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ANNOUNCEMENT 





NEW PUBLICATION POLICIES 


This issue marks the beginning of our 11th year of publication, volume X having 
completed our first 10 years in October 1954. The years between 1945 and 1954 were 
busy and important ones for clinical psychology which can truthfully be said to have 
left adolescence and entered healthy young maturity. The impetuous progress from 
World War II has survived very well the difficult transition into peace-time utiliza- 
tion at full capacity. Our young profession has accomplished within a decade what 
older professions took scores of years to achieve, and has withstood this telescoping 
of its youthful development without too much scarring from mistakes or false starts. 
All of the credit for this administrative and promotional development should go 
individually to the officers and members of the AMERICAN PsycHOLoGicaL Asso- 
CIATION who have labored so selflessly te build solid foundations. More has been 
accomplished in healthy ways than could ever have been expected in 1945 when our 
first directions were just being plotted. Division 12 has grown to be one of the 
largest, most respected and influential divisions of the APA, and its committees 
have made great progress in advancing its purposes using research approaches. The 
AMERICAN Boarp oF EXAMINERS IN PROFESSIONAL PsYCHOLOGY was speedily 
launched and is rapidly fulfilling its promise in standardizing the requirements lead- 
ing to certification. A review of the issues discussed or advocated in the editorial 
pages of this Journal reveals the number of problems which have been faced and 
solved. Although not always supported in our editorial opinions by all our readers, 
we feel some sense of accomplishment if we have in any way helped to stimulate 
some of the things which have come about. 

The next decade brings with it new phases of development of our profession 
which necessitate a review of the objectives which this Journal should seck to attain. 
Our specialty now has a smoothly functioning division in the APA with its own com- 
mittees to handle important problems as well as a newsletter to inform members 
what is being done. Matters relating to the establishment of adequate training fac- 
ilities, the handling of interprofessional relationships, organizational and promo- 
tional activities, certification and licensure, public relations, and the advancement of 
research are now being well handled by our duly elected representatives in the APA 
so there is less need for private agencies to be so forward in pioneering the road ahead. 
We can leave such matters to those whose responsibility it is to handle them. In the 
past, the Editor of this Journal with the support of the Editorial Board has conceived 
it to be one of our functions to act as a loyal stimulant to official organizations by 
bringing issues out into the open by pointing out things which need to be done. We 
shall continue to do so in the future, if need arises, but less vehemently because 
proper channels and mechanisms for the solution of professional problems have now 
been established. Editorial opinions in the future will be more concerned with 
critical evaluation of research and theoretical developments in clinical psychology. 


TRENDS IN PsyCHOLOGICAL JOURNALISM 


The last ten years have witnessed marked changes in publication policies of 
official and private psychological journals. Our first issue in 1945 contained only 7 
scientific papers averaging 11.9 pages in length. The last issue of this period, ap- 
pearing in October 1954, included 22 scientific papers averaging 3.9 pages in length. 
One explanation of this trend is offered by Girden and Dennis! who point out that 
the number of publications by new APA associate members averaged only 18 in 1946 
but jumped to 1478 in 1953. In 1945 it was our editorial experience that relatively 
few papers were being submitted by clinical psychologists to a relatively large num- 
ber of publication outlets. The official journals have always been the first choice of 
publication outlets for many authors, and on numerous occasions the private journals 


1GIRDEN, EpWARD AND Dennis, WAYNE. Publication trends in American Psychology—A five 
year extrapolation. Amer. Psychologist, 1954, 9, 632-635. 
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have not had a sufficiently large number of publishable manuscripts to select from. 
During several periods of the last ten years, your editor has had to resort to symposia 
and invited papers in order to secure a reasonably high calibre of papers to publish. 
Our policy has been to give the highest priority to the better research studies sub- 
mitted and then to fill up issues with as many papers of lesser quality as could be 
included. Some papers of questionable quality have been accepted for publication 
because nothing better was available when the publication deadline for an issue 
arrived. Even so, we have always assigned the highest priorities to actual research 
studies and have consistently rejected purely theoretical or speculative papers un- 
less they happened to deal very provocatively with issues of general professional 
importance. We have been rather surprised at the relatively few clinical case studies 
and reports of clinical experience from clinics or institutions which have been sub- 
mitted. 

Another factor relating to the length of papers published has been a change in 
editorial policy adopted by most journals in seeking to limit the length of papers 
accepted for publication. It is now more generally recognized that few research 
papers are of such importance as to be assigned the space devoted to them 10 or 20 
years ago. The large majority of papers submitted for editorial consideration are 
actually minor studies which do not deserve major space allotments. Only a few 
research studies in any year represent major contributions justifying over-detailed 
reporting. As interpreted by this editor, part of the trouble stems from zealous 
laboratory instructors in advanced expe1imental psychology who require graduate 
students to write voluminous laboratory reports summarizing the literature and the 
findings in minute detail. When such students begin to contribute to the literature, 
they naturally feel that research reports should be written in the same detail as 
required of laboratory reports when such actually is not the case. In the journals of 
the older physical sciences such as physics or chemistry, some of the most significant 
papers ever published have been very brief and condensed. It is not necessary for 
authors to review all the literature, but only to summarize papers relating specific- 
ally to the problem under study. It is not necessary to include well known statistical 
formulas or to report raw data or mathematical computations in great detail since 
most readers are willing to grant that the author is competent to work up the data 
using standard methods without the necessity of reproducing all operations. It is 
not necessary to report the results of statistical analyses in duplicate form in text, 
tables and graphs however valuable such multiple presentations may be for ad- 
vertising purposes. Finally, the data may be allowed to speak for themselves with- 
out indulging in intensive speculations or rationalizations in the discussion to ex- 
plain why negative results were obtained or to resolve differences with other studies. 
A concise statement of problem, method, results, summary and conclusions is usually 
sufficient to communicate all that needs to be known to the reader. If any reader is 
interested in learning about the study more thoroughly, he can consult with the 
author in order to receive the data in greater detail. 

The policy of this Journal has been to try to strike a fair compromise between 
minimum requirements of conciseness and readability in order to communicate the 
findings to sophisticated professional readers and the maximal allotments hoped for 
by most authors. It is understandable that authors wish their work to be reported 
in the greatest possible detail and with complete discussions of all pertinent issues. 
All of us want our brain children to be decked out as lavishly as possible. But the 
law of diminishing returns begins to operate with respect to the amount of space 
which can be allotted to any one author. All publications have obligations to read- 
ers as’well as authors. Many young authors are too conscientious in “‘spelling out’’ 
descriptions of tests and methods which are actually well known to their sophisti- 
cated professional audience. 

Our readers are invited to submit to the editor at any time their suggestions or 
criticisms concerning how this Journal could be improved. Our aim is to conduct 
this Journal in a manner conducive to serving the needs of the greatest number of 
clinical psychologists We also invite correspondence from readers relative to pertin- 
ent issues and including suggestions for inclusion under Editorial Opinions. 
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ForMAT OF PAPERS 


With the current popularity of evaluating the “centeredness” of relationships, 
attention may be given to the purposes and objectives of scientific journalism. The 
primary purpose, of course, is research-centered, to communicate research data as 
expeditiously as possible. But there are also many other practical considerations 
reflecting the different interests of authors and readers. Authors desire to present 
their work as favorably as possible in order to win scientific recognition, strengthen 
academic prestige and tenure, establish a reputation of productivity and carefulness, 
and otherwise demonstrate research competence and industry. The importance to 
authors of such motives is revealed in behaviors such as breaking up research results 
in serial parts to produce as many papers as possible, publishing essentially the 
same data in slightly different form in several journals, stretching papers out in 
length to make them appear more weighty, and engaging in interminable contro- 
versy in published corrrespondence over differences of opinion. All this may strength- 
en an author’s ego and his academic repute, but it does not improve the quality of 
journalism. There have recently appeared in print a number of defenses of long 
papers as necessary in order to present data and important methodological points 
in the completest detail but these appear to be rationalizations of more personal 
motives. Our impression is that some o: the weakest papers are written at the great- 
est length with lengthy speculation concerning why more positive and significant 
results were not obtained. 

From the readers’ viewpoint, an entirely different set of needs become apparent. 
The average reader has only a limited amount of time for covering the literature 
and wishes to invest reading time with maximum productivity. Also, most readers 
have special interests and wish to concentrate in some areas and skip lightly over 
others. We predict that an actual study of what the average reader does with his 
journals would reveal that he glances quickly over the index to see if anything is of 
interest, quickly peruses the introduction and summary of interesting titles to see 
if they are worthy of further study, actually reads a couple of major papers in detail, 
and gives little or no attention to papers of minor interest. In this process, few 
readers read a journal from cover to cover, a majority skim it over for articles of 
major interest, and a few readers just toss the journal number onto the growing pile 
which marks the fact that he has a professional library. Whatever authors may hope 
for, the fact seems to be that most readers will invest only a certain amount of time 
on the literature, and research papers may be overlooked completely unless they are 
able to attract the reader’s attention by a stimulating title and by holding his at- 
tention through making the paper readable enough so that the reader can grasp the 
essential facts quickly and easily. If a paper looks too formidable or difficult, many 
readers simply pass it by. It goes without saying that if a reader can pick up all the 
essential facts in five minutes from a paper that has been boiled down to concise 
outline form, he will get much more than if he has to pore over the paper for 30-60 
minutes in order to grasp the terminological complexities and methodological fine 
points which appeal to authors. An author gains nothing if the subscribers don’t 
read the paper. 

In this editor’s opinion, there are very few research studies whose results can- 
not be summarized in a few short paragraphs. The reader usually leaves a paper with 
a capsule summary that the author studied such and such a problem, using certain 
methods, and reached this or that conclusion. Only a few readers are so specially 
interested in any one paper as to require more detailed information which can always 
be obtained from the author. What most readers seem to want is a broad overview of 
the findings of the most significant research in the field. 

Medical journalism is older than psychological journalism and has undergone 
a drastic revolution in the last 20 years. Gone are the old type journals in which 
a single issue contained only a few lengthy articles requiring a long night to peruse. 
Now in vogue are short concise writeups which can be read and grasped in a few 
minutes without loss of any important information. This is achieved by intensive 
editorial revision which boils the article down to its kernal. Interestingly enough, 
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such advances have been largely made by private medical journals, with the official 
publications lagging far behind in trimming their sails to modernity. 

The policy of this Journal is to sponsor short concise readable research reports 
which convey the basic facts as briefly and succinctly as possible. This can usually 
be accomplished by presenting the basic facts in outline form covering statement of 
the problem with a short review of previous research, a terse statement of method, 
a concise analysis of results, and a brief resume of the whole paper in the summary 
and conclusions. The data should be allowed to speak for themselves. Among the things 
which could well be omitted are the standard cliches explaining why different samp- 
ling probably explains why the results of the present study are not in agreement 
with other reports, lengthy rationalizations of negative results, and wordy specula- 
tions about what might have happened if things had been different. 

Although some authors have been resentful and critical of our efforts to stim- 
ulate better journalism, most authors have spontaneously commented that their 
revised efforts were actually much more readable and better presentations. This 
Journal continues to accept the Publication Manual of the APA as the standard for 
style, abbreviations, captions, arrangement of figures and tabular materials, head- 
ings, references and punctuation.? In spite of frequent exhortations to authors 
to follow standard style and arrangement closely, at least 50% of papers submitted 
require extensive revision of all parts of the paper including bibliographical refer- 
ences. Much time would be saved and publication advanced if authors would follow 
our style closely without being prompted. 

This policy of requiring concise report writing does not mean that we are en- 
couraging authors to break research up into segments for serial reporting. In con- 
junction with APA editorial policy, we cannot accept papers with titles indicating 
that they are part of a series. Research must be published as a whole. We also accept 
papers with the specific understanding that alternative writeups of the same data 
are not being submitted to other journals. 


PUBLICATION Costs 


Few authors comprehend how much of a financial investment is made by a 
journal when a paper is accepted for publication. In addition to high printing costs, 
every journal has an high administrative overhead even if editorial costs are not 
considered. If strict accounting procedures are applied to overall publishing costs, 
it wili be found that investments running up to hundreds of dollars are given to 
authors when an editor accepts their papers. The official APA journals are able to 
make large investments in single papers because (a) some parts of the costs may be 
subsidized by members at large, (b) the APA central office handles manufacturing 
and distribution problems, (c) larger subscription lists, and (d) editors serving with 
only modest honoraria. Private journals must maintain financial solvency in order 
to insure continued publication, and must amortize all editorial, manufacturing, 
administrative and distribution costs within a strict budget. All this is becoming 
increasingly difficult as journal publication becomes big business.. Operating a major 
journal now requires full time attention of an editorial and clerical staff if all matters 
are to be dealt with efficiently and expeditiously. Editorial responsibilities are such 
that former informal arrangements are no longer satisfactory, and editors should be 
recompensed in direct proportion to their investment of time and ability. All this 
means that detailed attention must be given to the point of diminishing returns in 
making publication space commitments to authors. 

The policy of this Journal has been to grant authors a gratis allowance of $50.00 
toward printing charges and to bill the author for any excess space at cost. This 
arrangement with authors covers printing costs only and makes no allowance for 
other operating charges. This policy frequently has been waived for very important 
long papers in which authors were unable to finance costs, but has been applied 
rigorously to minor papers where the decision to utilize extra space is the preference 
of the author. Analysis of actual charges made in the past reveals that actual print- 


*Publication Manual of the American Psychological Association. Psychol. Bull., 1952, 49, 1-61. 
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ing costs have been substantially larger in most cases than billings to authors. 
The gratis allowance usually permits authors to have up to four full pages of text, or 
two to three pages if there are extensive tabular materials, without excess costs. We 
feel that this policy is fair both to authors and readers. Most papers, even including 
major researches, can be adequately reported in this space if the author is concise 
and succinct in presenting an informative outline of problem, method, results and 
conclusions with a minimum of reviewing of the literature and speculative dis- 
cussions. We will make special provisions in the future for publishing longer papers 
without additional charges to authors where no means of subsidizing excess costs 
can be found. We will also publish minor papers, not otherwise acceptable because 
of publication priorities, where the author is willing to assume all printing charges. 


PUBLICATION LaG 

Beginning in 1955, we hope to eliminate publication lag completely. In the past, 
many editors have deliberately maintained a lag of 12 to 24 months in order to have 
on hand a supply of manuscripts at all times. Such a deliberate delay seems com- 
pletely unjustified and we plan to eliminate it by selecting sufficient papers in any 
three months period to fill the next issue, and returning the remainder to authors 
immediately so that there will be no time lost in their arranging for publication else- 
where. This means that we shall accept the highest priority articles for immediate 
publication in order of receipt during each quarter. The time lag from receipt of arti- 
cle to date of publication will be no shorter than 3 months and no longer than 6 
months. Articles cannot be published in less than 3 months because this is the time 
required to process an issue. In this manner we hope to speed up publication time 
greatly for authors of acceptable papers with no extra charge made for immediate 
publication. In the event that an important paper is received after the quarterly 
quota is filled, it will be retained for inclusion in the issue of the succeeding quarter 
with no more than three months delay. 


CALL FOR PAPERS 


We are still not receiving papers with as wide a variety of content as desired. 
In April 1953 we printed a notice to readers outlining some ideas for papers we would 
like to print. Every editor becomes aware of hiatuses in the field indicating areas in 
which research should be stimulated. We hereby invite authors to submit papers in 
the following areas: 
Evaluation of Affective-Impulsive Status. The development of objective criteria for the 
definition and measurement of emotional status. 


Measurement of Control Mechanisms. Methods for the evaluation of voluntary self-control 
are urgently needed. 


Evaluation of Ideological Composition. Study of the content of thinking with particular refer- 
ence to ideological status. 


Studies of Attitudinal Psychopathy. Methods for demonstrating the core attitudes and con- 
stellations of attitudes which characterize a person. 


Quantification of “Style of Life’. How to discriminate the basic attributes of life style. Also, 
the influence of life style on maladjustment. 


Studies of Group Membership, Role-playing and Situational Determinants. Categorization of 
situational maladjustments, their etiology and resolution. 


Studies of Rare Clinical Syndromes. We need more extensive case studies of personalities i in 
the border area between normality and mental disorder. This would include “eccentrics” and 
persons with unusual character structure rather than the standard psychoneurotics and psychotics, 


Operational Case Study Methods. Illustrating the integration of data from all available 
operational levels. 


We would like to increase our volume of papers submitted by at least 50% and 
covering a wider range of topics. Authors are invited to submit papers for immediate 
consideration. Our new policy of immediate publication cannot be maintained unless 
a dependable flow of suitable manuscripts is maintained. We hope that authors 
desiring immediate publication will submit their manuscripts in the near future. 
Our usual policy is to accept or reject papers within two weeks after receipt. 


y U.S. 





SCOPE AND LIMITATIONS OF THE GOODENOUGH DRAWING TEST 
METHOD IN CLINICAL WORK WITH MENTAL DEFECTIVES 


HERBERT C. GUNZBURG 


Monyhull Hall, 
Kings Heath, Birmingham, England. 


INTRODUCTION 

Drawings of a “Man” have frequently been used for estimating the mental 
level of children and subnormal or abnormal adults. Correlations with objective 
intelligence tests have at times been high and have supported Goodenough’s original 
work), On the other hand there is much evidence that correlations are often so 
low as to be of little help in practical work and hence the drawing has been frowned 
on as an emergency test for quick assessment of mental level. 

A more detailed study of the published work suggests that the low correlations 
tend to occur in samples of abnormal composition and that otherwise correlations 
are generally satisfactory when obtained from similar samples as Goodenough’s 
original group. The results of some investigations seem to suggest that the Good- 
enough score is usually indicative of mental level only in those cases where mental 
functioning is not affected by neurotic or psychotic factors. It appears that correla- 
tion coefficients will vary greatly, depending on the number and strength of abnormal 
factors in a given group and little significance can be attached to findings obtained 
from psychologically heterogenous groups even though they may be homogenous in a 
legal sense, e.g. certified mental defectives. This makes it advisable to avoid using 
the drawings by known neurotic or psychotic persons for estimating the intelligence 
level, though the MA obtained therefrom may happen to agree with estimates from 
other sources because these too may have been affected by the pathological factors. 

These observations relating to the literature on the subject are supported by the 
results of the investigations summarized in this paper. In particular they demon- 
strate: 

a. The comparatively high correlation between an established intelligence test 
and the drawing of a man scored on an abbreviated Goodenough scale 7f the popu- 
lation is fairly free of pathological factors; the comparatively lower correlations 
existing in “pathological’’ populations. 

b. The use of certain criteria in the drawings to differentiate between drawings 
indicative of intelligence level and others where no such estimate should be attempt- 
ed. These criteria may also indicate pathology of some kind which will require furth- 
er clinical investigation. 

c. The possibility of predicting with reasonable accuracy the probable Wech- 
sler non-verbal 1Q, and MA, from “non-pathological” drawings. 


METHOD 


All drawings used in the present investigations were obtained in course of psy- 
chological routine examinations of certified and institutionalized mental defectives 
of both sexes. They are not a representative sample of the institution population 
because the younger male age group had been tested more widely and frequently in 
course of other research projects. The conclusions drawn in this paper may there- 
fore have particular significance for this group only and caution must be exercised 
in their application to older subjects and to women. 

The drawings of 344 certified adult defectives have been used, 236 men and 108 
women. All drawings were scored on a shortened Gcodenough scale of 20 items pub- 
lished by Berdie“ using the original Goodenough criteria (items: 4c, 6a, 6b, 7d, 8a, 
8b, 9b, 9c, 9d, 10c,1la, 11b, 12b, 13, 14f, 15a, 16b, 16c, 17b, 18a). Berdie found a 
correlation of .97 between the shortened and the long scale and this has been con- 
firmed by the present writer in a sample of 80 defectives“. This shortened scale is 
convenient in practical work and all Goodenough scores mentioned in this paper 
relate to the shortened scale. 
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Three drawings were obtained from each subject: “A person”; “A man” or 
“a woman” being the opposite of the sex of the first drawing; ‘““A man’ drawn with 
colored pencils. The usual instructions as recommended by Goodenough were given 
with the following addition before commencement of the second and third drawing: 
“Draw a man (woman) and make it quite clear it is a man (woman) and not a woman 
(man).” Before the beginning of the third drawing a range of colored pencils was 
offered with the explanation that the drawing of another man was wanted but “‘it 
need not be the same man . . . you can draw a different man if you like. You must 
only use these colored pencils. . . use as many as you like. . . use whatever colors you 
like . . . please yourself.’”’ The latter part of the instructions was repeated twice be- 
fore commencement of the drawing. 

The correlation coefficient of the 344 Goodenough scores, using the better draw- 
ing of the two “men” obtained in the session, with the Wechsler non-verbal IQs was 
.54. This coefficient, while indicating the existence of a significant relationship is 
rather inadequate for practical purposes and unreliable in estimating the true cor- 
relation in the mentally defective population generally since the drawing is extremely 
sensitive to the presence of pathological factors®. The size of the correlation appears 
to be directly dependent on the proportion of ‘‘pathological”’ and ‘‘non-pathological”’ 
defectives in the sample and is merely an expression of the local conditions at the 
time of testing. This was demonstrated in the following experiment. 


THE GoopDENOUGH ScorE oF “‘PATHOLOGICAL”’ AND ‘‘NON-PATHOLOGICAL”’ 
POPULATIONS 


Two samples of 40 cases each were matched carefully for age, sex, and Wechsler 
IQ’s (Full, Verbal, Non-verbal). Statistical analysis of these groups of young men 
between ages 16 to 26 and with comparable intelligence levels indicated that what- 
ever small differences may have existed between the two groups they did not amount 
to statistical significance (Table 1). Subjects were allocated to either group on the 


basis of case conference decisions or other clinical diagnostic procedures. The 
“‘Non-pathological”’ group comprised only cases where clinical diagnosis had been 
unable to establish the presence of any other factor than general mental inferiority. 
Broadly these men represented the familial, endogenous, sub-cultural or “garden- 
variety” defective. The ‘“‘pathological” group on the other hand was made up of 
defectives with signs of organic brain damage, psychosis, epilepsy or pronounced 
maladjustment. 


TaBLe 1. Means anp SDs or WEcHSLER IQs AND THE CORRELATION COEFFICIENTS AND PEs wItH 
Tue GoopENOUGH Score oF Two Groups oF CERTIFIED MENTAL DEFECTIVES 
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The correlation coefficients obtained by the two groups are listed in Table 1. 
It will be observed that the highest correlations were obtained with the non-verbal 
intelligence scale and that the “non-pathological”’ group shows consistently higher 
coefficients than the “pathological” group. Indeed the drawings of the pathological 
group were of much poorer quality (Mean Goodenough IQ 65, 8. D. + 17.67) than 
those of the non-pathological group (Mean Goodenough IQ 74, 8. D. +19.35) and 
the ¢ test indicated that the difference was statistically highly significant. 

Since the variables of sex, age and intelligence levels had been held constant 
it can be reasonably assumed that the lower correlations found in the pathological 
group were due to its characteristic make-up. It appears that the pathological 
factors tend to lower the correlation between drawing level and intelligence quotient 
and the drawing of a man can therefore reasonably be expected to give an indication 
of intellectual level in non-pathological subjects only. 


CHARACTERISTIC FEATURES IN DRAWINGS OF ‘‘PATHOLOGICAL”” DEFECTIVES 


A detailed study of the drawings of these two groups established the preponder- 
ance of certain features in the “pathological” group compared with the clinically 
“non-pathological” group. By inspection and reference to the literature on the art 
of the abnormal, a number of characteristic features was chosen for further investi- 
gation. After the analysis the following short list of “signs” was compiled, all of 
which taken as a whole, appeared relevant to the problem of a differential diagnosis 
between “‘pathology” and “non-pathology”. There were many other “odd” features 
in the drawings or mentioned in the literature but their incidence was too infrequent 
to permit generalisation. 

The 80 sets of drawings were scored for presence or absence of ‘‘drawing path- 
ology” using the following criteria: 


a. Use of unrealistic color. This point can not be scored if only 1 color is used since most 
low-grade defectives pick any colored pencil using it like an ordinary pencil without attention to 
color. If more than 1 color was used the point was scored even if “unrealistic color” was quite 
insignificant, e.g. red pupils in an otherwise “normal” drawing. 

b. Transparency. Some latitude is necessary in scoring this point. Minor lapses involving 
transparency (e.g. head seen through hat, body outline through an arm) were not scored. If 
however the artist had conceived body and clothing as two distinctive and comparatively un- 
related features of the “person”’ and signifying perhaps the covering properties of the dress only 
at a later stage through shading, “transparency” was scored. Circles for breasts were not in- 
cluded under this heading if there was no clear and detailed indication of dress. 


c. Peculiarity in Style. Some drawings showed little attempt at realism. They were mostly 
angular and patterned, elongated and systematically drawn, executed with sure control and no 
essential part of the figure was omitted. The finished drawing was neat, well spaced, large and 
firmly drawn, looking very weird and grotesque, nearly like a primitive idol from the South sea 
islands. Drawings which were merely primitive were not scored under this heading nor drawings 
which exhibited peculiarities only in parts. 

d. Lack of Motor Control. This point was scored in drawings which showed difficulties in 
effecting closures and in drawing straight lines. Sometimes the pencil appeared “to run away” 
and sometimes the artist attempted to overcome his shortcomings by an irregular scribbling re- 
inforcement of lines. “Sketchy” drawings were not included under this heading. 


e. Incomplete and/or disorganized drawings. ‘‘Incompleteness” was considered to be “path- 
ological” only if major parts or features had been omitted which, judging from the general level 
of the drawing, should have been included. This applied for example always to missing legs but 
not to missing arms in very low scoring drawings. The absence of arms or mouth has indeed more 
often than not “neurotic” significance as these “tension areas” reflect often the artist’s malad- 
justment. 

Disorganized drawings showed definite and marked displacement of the various body parts. 
A mere failure in judgment of proportion (e.g., arms attached too low) was not counted but if 
parts of the drawing though placed correctly, remained in isolation and without connection to 
each other, the point was scored. 

: . Shading. This point was credited when shading of any type covered at least approx- 
imately 25% of the body area. The quality of shading was not taken into account but in “Doubt- 
ol final score, light regular shading to indicate texture and dress appeared to suggest non-path- 
ology. 

. Crude Sex. This category referred only to male or female genitalia (not breasts). 
Special attention was paid to the often rather unrecognizable kinds of representation occurring 
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in the drawings of defectives. In many cases “crude sex” was drawn in response to the instruction 
“make it quite clear it isa man (woman)”. Symbolism was not counted nor any socially accepted 
portrayal of sexual characteristics. 


Besides these frankly pathological features it was noted that many of the low- 
scoring drawings of the pathological group contained elements which were rather 
characteristic of higher mental levels and therefore inconsistent with the scored 
Goodenough level. It rather suggested the presence of a higher intellectual potential, 
as for example established in the Wechsler test, which failed to express itself fully in 
the drawing. The following inconsistencies were selected for low-scoring drawings 
of “men”: 

a. Profile. Irrespective whether the whole person was drawn in the profile position, this 


point was scored when the head of any of the three persons was drawn in profile and the Good- 
enough score was 8 or less. 


b. Two or more colors. A colored drawing scoring 8 Goodenough points or less was usually 
only an outline drawn with one colored pencil instead of an ordinary one. The point was scored 
whenever more than one color was used and became diagnostically more significant the lower the 
score was. 


c. Color used for or Drawings scoring 8 or less Goodenou ugh points which used color for 
covering area (approx. 50%) were included under this heading. Color was usually used for the 
suit and received credit. Shoes or hat only was not considered. 


d. Sexual intention. If the Goodenough score was 2 points or less any attempt to different - 
iate sex by other characteristics than “long hair’ for the woman was checked under this heading. 
Drawings scoring 3 or 4 Goodenough points had to exhibit at least two other sex characteristics 
besides “hair”. In this particular instance the feature was scored even if the “‘sexual intention”’ 
did not proceed beyond a stage of hesitation and disturbance when sexual areas were approached 
in the drawing. The tester had to watch carefully for signs of uneasiness in the treatment of the 
genital area in both sexes. ‘Crude sex’’ was of course not scored under this particular heading. 
Usually the point was credited for characteristic shaping of the body outline to indicate waist 
and breasts, for depicting of the “flies” or, —the most common occurrence, —for “open treat- 
ment” which signified the failure to close the crotch area between the legs in either sex. 


These ‘‘inconsistencies” were extremely helpful in differentiating the genuine 
“primitive” drawings of the “garden-variety”’ defective from the productions of the 
“pathological” cases which scored often far below their Wechsler intelligence level. 
Color, for example, was used far more “‘intelligently”’ by the pathological group, 
20% of their low-se oring members used more than 1 color and 41% used color for 
area cover. The non-pathological group used more than 1 color in only 8% of the 
cases and used it for area in 11%. Similar findings were obtained regarding the 
other “inconsistencies” and “pathological features’ and have been discussed else- 
where“, 

The final score showed that the pathological group exhibited in their drawings 
67 “‘‘pathological features’ and ‘‘inconsistencies’’ but that the non-pathological 
group, despite being of the same sex and having the same age and intelligence range 
showed only 18 indicators of pathology. Statistical analysis indicated the possibility 
of establishing a “cut-off score” resulting in a threefold division (Table 2). 


TABLE 2. ScortIna OF THE DrAawinG TEstT ror PaTHoLoGy 








Criteria Diagnosis 





0 or 1 “inconsistency” 
Non-pathological 


and 
0 “pathological features” 





1 or 2 “inconsistencies” 
an Doubtful 
1 “pathological feature” 


or 
1 “pathological feature” by itself 





3 or 4 “inconsistencies” 
and/or Pathological 
2 or more “pathological features” 
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Scoring the 80 sets according to Table 2 showed a good correlation between the 
“diagnosis” of pathology or non-pathology on the basis of the drawings and the 
previously obtained clinical diagnosis of presence or absence of pathology. 74% of 
the cases were placed by the drawing criteria in the same category as by psychiatric 
opinion and only 12.5% disagreed with psychiatric diagnosis. 13.5% were consid- 
ered “doubtful” on the basis of the drawing criteria. This result is statistically 
significant at the 1% level of confidence. 

The correlation coefficients with the Wechsler non-verbal scale for the 80 
“men” of this sample were: Non-pathological drawings, N = 31, r = .82; patholo- 
gical drawings, N = 30,r = .41; “doubtful” drawings, N = 19,r = .73. The “‘doubt- 
ful’ drawings correlated in this sample rather well with the Wechsler scale and in- 
spection of the drawings indicated that most of them had a “non-pathological”’ 
character. It was later found that it is comparatively easy in clinical practice to 
place many of the “doubtful” drawings correctly in one or the other more definite 
category by observing the general character of the drawing. 

This investigation demonstrated that the scoring for “‘pathology” according to 
the criteria given in Table 2 may serve two purposes: 

1. The presence of pathological features and inconsistencies in the drawing 
directs attention to the probable existence of clinical pathology or maladjustment 
and to the possibility that the drawing level has been affected by it. The Gooden- 
ough method for estimating intelligence should not be used in these cases. 


2. The absence of pathological features and inconsistencies may indicate a 
corresponding absence of significant clinical pathology and valid results with the 
Goodenough method can be expected. 


Tue PREDICTION OF PROBABLE WECHSLER NON-VERBAL INTELLIGENCE 
LEVEL From Non-PATHOLOGICAL DRAWINGS 


The high correlation of .82 between Goodenough score and Wechsler non- 
verbal IQ obtained in the sample of 31 non-pathological drawings suggested the use 
of the drawing score for estimating non-verbal intelligence quotients and mental 
ages. A breakdown of the 344 cases according to age and sex and the presence or 
absence of pathology (Table 3) showed generally the same pattern as observed in 
the pilot investigation. The highest coefficients were found in the non-pathological 
drawings, the lowest in the pathological ones with the “doubtful” drawings usually 
having a coefficient between the two extremes. Nearly all of these correlations ob- 
tained in small and more homogenous groups were statistically significant but only 
the coefficient .73 relating to the non-pathological drawings was high enough for 
computing the regression equation. 


TABLE 3. CORRELATION COEFFICIENTS OF GOODENOUGH SCORES WITH WECHSLER 
Non-VERBAL INTELLIGENCE QUOTIENTS IN GROUPS OF VARYING COMPOSITION 








Non-pathological Doubtful Pathological 
Population drawings drawings drawings 
I r N r N r 





Men under 30 86 er 47 .65** .30* 
Women under 30 22 .58** 27 .46* .61°* 
Men over 30 9 16 Pe | .52°* 


Women over 30 14 _86** 6 — f 
Total 131 A ae 96 .46** .36** 


44 














Grand Total: N 344 , r= .54 
**indicates correlation coefficients at the 1% level of significance 
*indicates correlation coefficients at the 5% level of significance. 


It was found that the scores of the shortened Goodenough method forecast the 
Wechsler non-verbal 1Q with a standard error of estimate of +11.91. In other words 
the chances are approximately two in three that the predicted Wechsler non-verbal 
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IQ does not miss the actual IQ by more than 12 points. The standard error of estim- 
ate can be considered as a standard deviation and these 12 points do not compare 
unfavorably with the Wechsler non-verbal 8.D. of 14 points for the age group 17-29. 

Similarly a prediction of non-verbal MA can be made, using Wechsler’s ““Equiva- 
lent Mental Age” table which provides for a span from 7 to 15-16 years. The correla- 
tion coefficient here, r = .78, is somewhat higher in the same sample because the 
MA ratings appear to agree better with the drawing score in older people than the 
Wechsler IQ which expresses the individuals standing in relation to his age group. 
The standard error of estimate is + 18.7 which means that the chances are two in 
three that the forecast of Wechsler’s non-verbal MA will not be out by more than 
19 months. 

On the basis of the correlation coefficient found in a sample of 131 non-patho- 
logical drawings the following Table 4 was constructed which gives tentative Test 
Ages and Test Quotients of the Drawing Test and predicts the probable Mental 
Age and Intelligence Quotient of the Wechsler non-verbal scale. 

As the sample is heavily weighted in favor of young men the criteria and norms 
may not apply equally well to female defectives who find the task of drawing a 
“man” at times rather disturbing. Generally speaking it is advisable to apply the 

_T. A. ratings particularly to the older age group whilst the T. Q. and T. A. ratings 
are equally helpful in determining a younger defective’s non-verbal intelligence level. 
Care should be taken not to over or underestimate a defective’s level at the ends of 
the scale and clinical judgment must be applied in these cases more than usually. 
For instance, the two most frequently obtained credits at the lower end of the scale 
(hair, ears) often tend to inflate the T. Q. out of proportion to the otherwise very 
primitive character of the drawing, while at the top end of the scale quite competent 
drawings showing movement and good conception lose Goodenough credits on ac- 
count of sketchy execution where details may have been disregarded. It has been 
found useful in clinical practice to use the full Goodenough scale in these cases, as 
the abbreviated form is not sensitive enough at the “ends’’. 


TaBie 4. TENTATIVE Test QuoTIENTS AND Tgst AGES ON THE BASIS OF THE 
SHORTENED GOODENOUGH ScaLE BY BERDIB 
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N = 131 : Correlation with Wechsler Non-verbal IQ : r .73 
Standard Error of Estimate 11.91 


Correlation with Wechsler Non-verbal MA :r .78 
Standard Error of Estimate 18.7 


Discussion 


Pathological features and inconsistencies of different character than those 
described in this paper are found frequently in drawings by defectives. These iso- 
lated occurrences do not, however, appear to indicate anything else than that we 
are dealing with a subnormal population where we have to expect some deviation 
from the normal. Only when, in an individual case, there are several deviations from 
the norms derived from a comparable population, it suggests the existence of devia- 
tions in personality make-up as has been shown here and elsewhere“ 5). A pro- 
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cedure as outlined in the preceding paragraphs demonstrates this approach objective- 
ly but it is not suggested that ‘“‘scoring”’ should be used without discrimination. On 
the whole an experienced clinician will be able to give weight to the various strange 
features appearing in drawings though the presence of ‘‘inconsistencies” may pro- 
vide a useful guide in those cases where pathology is not conspicuous. 

In clinical practice it has been found that the most valuable suggestions were 
obtained by comparing the drawing with other test results. In cases of the “garden 
variety defective” the drawing level agreed well with other tests besides the Wech- 
sler. Marked discrepancies, however, between the drawing, whether pathological or 
non-pathological, and other tests directed attention to certain character malforma- 
tions, instability and emotional immaturity. Generally speaking, the drawing in- 
dicates intelligence level fairly well and where there are marked discrepancies the 
explanation should not be in terms of “lack of validity or reliability’? but should 
refer to individual personality aspects. “ 


SUMMARY 


This paper has attempted to demonstrate that under certain circumstances the 
drawing of a “man” predicts with reasonable accuracy the Wechsler non-verbal 
intelligence level of certified mental defectives. In course of various investigations 
the following major results were obtained: 

a. In a sample of 344 unselected institutionalized mental defectives the cor- 

relation coefficient between the Wechsler non-verbal IQ and the drawing score 
was .54. 

b. The drawing scores of a clinically diagnosed ‘‘pathological’” group of 40 
male defectives were compared with those of a clinically ‘“non-pathological”’ 
group. Though statistical analysis showed no significant differences in age and 
the three intelligence quotients of the Wechsler Test, an r .64 was found in the 
pathological and .79 in the non-pathological sample. 

ce. It was shown that the drawings of clinically pathological and non-patholog- 
ical defectives differed characteristically in certain features. The whole sample 
of 344 cases was therefore subdivided into three groups on the basis of specified 
features in the drawings. The correlation coefficients were as follows: 

Absence of “Pathology” in drawings -r = .73 
Presence of “Pathology” in drawings -r = .36 
“Doubtful” drawings -r = 46 

d. It was suggested that a shortened Goodenough scale as previously pub- 
lished by Berdie would indicate the non-verbal intelligence level of the Wech- 
sler test if scored from ‘‘non-pathological’”’ drawings. A table of tentative test 
ages and test quotients was presented. 

e. The presence of pathological features in the drawing suggests that it is un- 
likely to indicate the appropriate intelligence level on the basis of a Gooden- 
ough score and also points to the probable presence of clinical pathology of 
some kind, ranging from maladjustment to psychosis or organic brain damage. 
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THE RELATIONSHIP BETWEEN CERTAIN PICTURES OF THE 
ROSENZWEIG PICTURE-FRUSTRATION STUDY AND 
CORRESPONDING BEHAVIORAL SITUATIONS 
BENJAMIN MEHLMAN AND STEPHEN LEE WHITEMAN 

Kent State University 


INTRODUCTION 
The need to evaluate and examine projective methods is not often enough ser- 
iously or generally questioned. As Mursell“ states, ““Too many authors and stu- 
dents of projective techniques make outlandish claims and look the other way when 
psychometrics are discussed in relationship to their test.’’ In order for these tech- 
niques to become useful and meaningful to all who use them, their validity and 


reliability must be established. The primary purpose of the present study is to de- 
termine the similarity of subjects’ responses to certain pictures of the Rosenzweig 
Picture-Frustration Study to their responses to analogous behavioral situations. 

There have been a number of studies concerned with the relationship of pro- 
jective tests to induced stress. Among others, Eichler® and Williams“? each in- 
vestigated Rorschach signs of anxiety in regard to stress-produced anxiety, finding 
that stress and non-stressed groups could be distinguished on the basis of the Ror- 
schach. Calvin ®) found that the TAT failed to predict certain laboratory behavior. 
Yarrow ©) found that antecedent frustration tended to increase the aggression found 
in projective play. In falsely reporting test scores to college students, French“ 
found changes on the Rosenzweig P-F test. In other studies using the Rosenzweig 
P-F test, Albee“? found that the type of aggression expressed on this measure was 
related to the diagnosis and prognosis of institutionalized mental patients, and 
Getling“ found juvenile delinquents to be more extra-punitive than a non-delin- 
quent group. 


METHODOLOGY 


Two hundred sixty three male and female college students between the ages of 
17 and 28, in Introductory Psychology classes, were administered the Rosenzweig 
Picture-Frustration Study. Of the 263 students tested, only 189 could be used for 
the study. The remaining students didn’t wish to participate in the experiment, did 
not follow up the appointments, or were otherwise unsuitable. These 189 subjects 
were divided into three groups on a non-selective basis. Due to the failure of some 
of the students to appear for their appointments, an unequal number of subjects 
(68, 54, and 67) are to be found in the groups. 

Pictures 10, 12, and 13 of the P-F test were selected as the ones to be considered 
in this study because they seemed to be most readily approximated in a behavioral 
situation. Picture 10 of the P-F test depicts a man accusing another of being a liar. 
Picture 12 portrays a man pointing out to another that the latter’s hat has been 
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taken by someone else who has left his own instead. Picture 13 depicts a man at his 
desk who is stating that he can’t keep a previously arranged appointment with a 
caller just arriving. 

Behavioral situations were devised to correspond to the pictures as closely as 
possible in terms of the obstacles with which the subject was confronted. The situa- 
tions were designed so that the subject could theoretically respond to the frustration 
in a similar Direction of Aggression and with a similar Reaction Type as on the P-F 
test. Each of the 189 students who participated in the study was exposed to only 
one of the three behavioral situations. The behavioral situations were administered 
to only those students who had made previous appointments. 

Each behavioral situation was viewed simultaneously but independently by 
three judges through a one-way window. The judges scored the verbal responses and 
general behavior of the subjects according to Rosenzweig’s system. Prior to the 
actual experiment the judges participated in several practice sessions. During these 
sessions the judges were instructed as to Rosenzweig’s scoring method. Subsequent 
to the practice sessions, the scoring reliability of the three judges was obtained. 

Situation I, designed to correspond to Picture 10, involved the administration 
of the Wechsler-Bellevue Information subtest in written form. E excused himself 
during the administration, only to return and accuse S of having cheated from the 
key (conveniently left on the table). 

Situation II, designed to correspond to Picture 12, involved having § leave his 
books one place and taking a test elsewhere. Upon his return, he found a different 
and shabbier set of books in place of his own. 

Situation III, designed to correspond to Picture 13, involved informing a sub- 
ject, when he appeared for his appointment, that the appointment could not be kept. 

Each of the students involved in the study was administered the entire Rosen- 
zweig Picture-Frustration Study and one behavioral situation. The technique of 
counter-balancing administration was used to cancel contaminations, with the 
possibility of carry-over effects due to the proximity in time between the two admin- 
istrations minimized by instituting a week’s time between administrations. In order 
to control for the possibility of the subjects associating the P-F test with the be- 
havioral situation an examiner other than the frustrater, who conducted the be- 
havioral situation, was employed to administer the Rosenzweig Picture-Frustration 
Study to the classes involved in the study. 

Both the Picture-Frustration Study and the Behavioral Situations were scored 
according to Rosenzweig’s method. The scoring categories which Rosenzweig uses 
and as used in the present study consider the subjects’ responses in terms of (1) 
the Direction of Aggression and (2) the Reaction Type. All the P-F tests were scored 
by the authors. The P-F Test protocols were coded so that scoring might be com- 
pletely independent of biases incurred during the behavioral situation. Three judges 
viewed the behavioral situations and scored them independently, as the situations 
were in progress or immediately after. The observers were not visually accessible 
to the subjects. Prior to the administration of the test and behavioral situations, the 
judges were given 30 selected samples of responses taken directly from the P-F 
manual to score. The reliability of scoring between the judges was determined by 
the number of agreements that were obtained. A further check on the reliability of 
the judges was obtained by taking a random sample of nine of the subjects’ protocols 
from the 189 used in the study. 


RESULTS 


Reliability of Scoring. In the three checks made on the reliability of scoring (30 
responses prior to the experiment, 9 selected protocols from the experimental data, 
and the 189 Behavioral Situation responses) agreement was found in each case to be 
significantly better than chance at or below the 1% level of confidence. The scoring 
reliability found suggests that the results obtained in this study should not be greatly 
affected by discrepancies in scoring. 
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The Relationship Between P-F Pictures 10, 12, and 18 and Behavioral Situations 
1, 2, and 3. The hypotheses that there were no statistically significant differences 
between the frequencies of response to each of the pictures and to the corresponding 
behavioral situation, for Reaction Type and Direction of Aggression, were tested, 
and with two exceptions, could be safely rejected at or below the 1% level of con- 
fidence. There is here evidence that neither the response to the individual picture 
nor the behavioral situation could be securely predicted from a knowledge of the 
other. The Direction of Aggression in the case of Picture 10 would appear to be the 
one exception, while the results for the Reaction Type for the same picture are not 
conclusive. 


TABLE 1. VALUES FoR COMPARISON OF AGREEMENT BETWEEN ReEsPonsEs GIVEN TO P-F Tgst 
PiIcTURES AND CORRESPONDING BEHAVIORAL SET AND DoMmINANT P-F RESPONSE. 








Behavioral Situations Dominant P-F Response 





Direction of Reaction Direction of Reaction 
P-F Picture Test Aggression Type Aggression Type 


10 .12 6.90 7.82 17.20 
12 19.10 35.17 25.07 60.40 
13 18.88 44.76 11.20 34.09 




















*These values are based on a 2 x 3 table where with 2 d.f. a X? of 
13.82 is significant at or below the .001 level of confidence. 


The Relationship Between Pictures 10, 12, and 13 and the Dominant P-F Scoring 
Categories. The hypothesis that there were no statistically significant differences 
between the frequencies of response to each of the pictures and to the Dominant 
P-F Scoring Categories, for Reaction Type and Direction of Aggression, could each 
be safely rejected at or below the 2% level of confidence. (See Table 1) 


The Relationship Between Behavioral Situations 1, 2, and 3, and the Dominant 
P-F Responses. The hypotheses that there were no statistically significant differences 
between the frequencies of response to each of the behavioral situations and to the 
Dominant P-F responses, for Direction of Aggression and Reaction Type, were 
tested, and with two exceptions, could be safely rejected at or below the 1° level of 
confidence. (See Table 2) The Direction »f Aggression for the Dominant P-F Res- 
ponse and Behavior Situation 2 could not be rejected, and for Behavior Situation 1, 
the results are_inconclusive. 


TABLE 2. VALUES FOR COMPARISON OF AGREEMENT BETWEEN 
RESPONSES GIVEN DURING THE BEHAVIORAL SITUATIONS AND THE 
Dominant P-F Responsss.* 








Behavioral Situation | Direction of Aggression | Reaction Type 


1 6.25 25.08 
2 67 75.46 
3 37.17 85.8 








*These values are based on a 2 x 3 table where with 2 d.f. a X? of 
13.82 is significant at or below the .001 level of confidence. 


Distribution of Responses. In Table 3 are found the number and percentages of 
subjects in each of the scoring categories. A heavy loading of responses into only 
one of the possible categories is consistently found. Table 4 shows the Chi Square 
values for the comparison of the expected and observed distribution of subjects, 


1The Dominant P-F Score was defined as the most frequently occurring category. 
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assuming an equal distribution of expected responses. Save for Direction of Ag. 
ession of Picture 12 (significant at the 2% level) the null hypothesis could be re- 
jected in each case at or below the 1% level of confidence. 


TaB.Le 3. DistrreuTion oF Sussects IN EAcu OF THE ScorING CATEGORIES 











Scoring Categories 

E M 0-D 
P-F Pictures No. | % No.| % | No. | % 
10 61 3 | 45| 2] 2.9 
12 11 | 20.4 29.6; 1 | 2.0 
13 | 38 | 56.7 
Behavioral 


Situations } 
| 60 | 88.2 | 5 4.4 | 16.2| 57 | 83.8 


2 | 30 | 55.6 29.6 | 57.4| 2 | 3.7 
3 | 14 | 20.9] 6.0 73.1 | 71.6| 3 | 4.5) 


| 


















































Dominant P-F | | 
Response 119 | 63.0} 38 | 20.1) 32 | 16.9) 1.1) 149 | 78.8 














| 











TaBie 4. VALUES FOR COMPARISON OF SussEcT DISTRIBUTION IN 
Eacu OF THE ScorING CATEGORIES* 








P-F Pictures Direction of Aggression | Reaction Type 


10 97.21 124.17 
12 7.44** 65.22 
13 26.37 








Behavioral Situations 

1 92.19 
2 13.78 
3 50.08 


Dominant P-F 
Response 74.95 186.38 














*These values based on a 1 x 3 table where with 3 degrees of free- 
dom a X? of 9.210 is significant at the 1% level of confidence. 
** Approaches the 2% level of confidence. 


DIscussIoNn 


Our results have been consistently disappointing. In at least the sample used, 
and under the conditions specified, we have demonstrated no evidence, with one 
exception, that one might successfully predict overt behavior from a knowledge of 
P-F performance. Further, evidence indicates that one may not safely predict from 
a knowledge of performance on any of the pictures utilized to Dominant Direction 
of Aggression or Dominant Reaction Type, or vice versa. In fact, with a single ex- 
ception, the evidence here indicates that one may not successfully predict from a 
knowledge of Dominant Response to any of the Behavioral Situations. Still further, 
a factor which may in large part underlie the failure cited above has been uncovered; 
i.e., a heavy and statistically significant loading of responses such that they tend to 
fall into only one of the several theoretical possibilities afforded. At least for the 
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sample here tested, it would appear that the pictures (and corresponding behavioral 
situations) do not elicit a sufficiently wide latitude of responses. 

One might argue, with some validity, that results on a projective test need not, 
and in fact should not, directly correspond to overt behavior. One cannot argue, 
however, that there should be no agreement. Further, since one of the major reasons 
for using a projective test is that such an instrument will reveal the personality in 
all its uniqueness, it follows that when results with such an instrument result in a 
homogeneity of responses rather than the heterogeneity we would expect, that the 
instrument is wanting, at least for the population used. Given the facts that only 
three of 24 pictures were used, that a college sample was used, that the academic 
setting may have influenced the responses, that extrapolation from an experimental 
to a real-life situation has pitfalls, and other circumstances peculiar to this particular 
study, it would follow that the Rosenzweig P-F Study is found wanting. 


SUMMARY 


The purpose of the present study was to determine the accuracy of the Rosen- 
zweig Picture-Frustration Study as a predictor of overt behavior. Three behavioral 
situations were devised to correspond with Pictures 10, 12, and 13 of the P-F Test. 
The relationships between responses to the individual pictures and corresponding 
behavioral situations, and each of these and the Dominant P-F response were almost 
uniformly such as could be secured on a chance basis alone. It was also found that 
there is a heavy loading of responses into only one of the possible scoring categories. 
The data indicate, considering that the sample was drawn from a college population 
and only three of twenty-four possible P-F pictures were used, among other circum- 
stances peculiar to this study, that one cannot accurately predict overt behavior 
from a prior knowledge of an individual’s test performance on the individual pictures 
or on the Rosenzweig Picture-Frustration Study as a whole. 
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A BRIEF SCREENING SCALE FOR PSYCHOPATHOLOGICAL PATIENTS 
DEVELOPED FROM MMPI SCORE-PATTERNING! 


JOEL M. CANTOR 
Los Angeles County General Hospital 


A major problem for many general hospitals is identifying those patients on 
admission who are in as much need for psychiatric treatment as they are for medical 
and surgical procedures. This article presents a scale of items which was developed 
to identify such patients with a minimum of error in just such settings. Its efficiency 
may originate in the unique manner in which the items were derived. The criterion 
group was obtained by searching for a specific pattern of scores on the MMPI. 


METHOD 


The means and standard deviations for the ten MMPI clinical scales of a sample 
of 250 male psychiatric patients were obtained from Rosen“). This sample repre- 
sented a random selection from MMPI s which had been administered to all testable 
male patients on the Psychiatry Service of the VA Hospital, Ft. Snelling, Minn. 
Each of the 250 MMPI’s was sorted to obtain those MMPI’s whose Hs score came 
within plus and minus one standard deviation of the mean Hs score of the sample. 
The standard deviation used was that of the sample. Those MMPI’s which were 
obtained were then sorted to find those whose D score fell within plus and minus one 
standard deviation of the mean D score. This procedure was carried out until only 
those MMPI’s remained (52 in number) which best fitted the mean curve of the 
sample in that all their scores were within one standard deviation of the mean scores 
of the respective scales. 

Counts were made to obtain the percentages of true and false responses to the 
550 MMPI items by the patients in the group of 52. This group will be referred to 
as the criterion group. These percentages were compared to those given in the 
normative tables for the MMPI“. The specific group used in the comparison (‘‘V”’ 
in the tables) represented the answers of 139 normal males.?, Those items which 
showed a difference of 10% or more were tested in fourfold contingency tables. 
Those items which showed a difference with a probability less than .01 were com- 
pared to another group of normals (“‘XI”’) in the tables. This group consisted of 152 
= males. Here again no percentages were available for ltems I-41 through 

-40. 

Although this method is not a cross-validation because the criterion group was 
not split for the two comparisons, the level of significance held to and the efficiency 
of the scale of items which resulted suggest that the greater part of those items pre- 
sent in the scale by chance were kept to a minimum. 

The first comparison resulted in 31 items occurring at the .01 level of signifi- 
cance or less. The second comparison had 25 remaining at that level. These items, 
the percentages of their occurrence and the significance levels of the differences will 
be found in Table 1. Chi-square was the statistical procedure used. Item J-4 could 
not be subjected to two comparisons because just the one percentage was available 
in the normative tables. However, the item was retained because of the extremely 
low level of significance at which it differed from the one percentage available and 
because of its face validity. 

It will be noted that all of the MMPI items in Table 1 have been rewritten. 
This was necessary so that the items could be put into mimeograph form and thereby 
more conveniently administered to the patients admitted at the Sioux Falls VA 


1This study was conducted at the VA Center, Sioux Falls, S. D. while the author was a member 
of the staff. Appreciation is expressed for the assistance given this study by the Chief Medical Officer, 
Dr. R. D. Green, and the Registrar, Mr. V. H. Vander Schoor. 
— for Items I-41 through J-40 where percentages are available only for 54 VA Hospital 
males. 





A BRIEF SCREENING SCALE FOR PSYCHOPATHOLOGICAL PATIENTS 21 


1. MMPI Irem Nomsers, THE REwRITTEN Form, THE “PLUS” PERCENTAGES, AND THE 
PROBABILITY OF THOSE ITEMS WHICH SHOW SIGNIFICANT DIFFERENCES 








“Plus” P 

Percent- with 
Rewritten Form ages Group 

N = 52 V 





Do you tire quickly? 
Do you think you are more nervous than most others? 
Have you ever fainted? 
Is your judgement now worse than it ever was? 
Do you wake up most mornings feeling stale and tired? 
Is your sleep fitful and disturbed 
Do you go to sleep most nights with thoughts or ideas 
bothering you? 
Have you at times very much wanted to leave home? 
Would. you attend a sexy show? 
Do you believe in miracles? 
Do you enjoy gambling for small stakes? 
Do you wish you could be as happy as others seem to be? 
Have you had periods of days, weeks, or months when you 
could not take care of things because you could not 
“get going’’? 
Do wor! frequently find yourself worrying about some- 
thing 
Do you often have spells of the blues? 
Does it make you nervous to have to wait? 
Do you have periods of such great restlessness that you 
cannot sit long in a chair? 
Do you sometimes become so excited that you find it hard 
to get to sleep? 
Have you at times been worried beyond reason over some- 
thing that really did not matter? 77 
Are your hardest battles with yourself? 79 
Have you sometimes felt that difficulties were piling up so 
high that you could not overcome them? 75 
Are you inclined to take things hard? 75 
Do you work under a great deal of tension? 63 
Do you certainly feel useless at times? 75 
Do you sometimes feel that you are about to go to pieces? 73 





Hospital. Rewriting each item changed it from the declarative statement of the 
original to a direct question. Some of the items were shortened considerably and 
others restated positively rather than negatively. It is doubtful that these modifica- 
tions altered the meanings of the items to the point where the meanings were basical- 
ly changed. It will be shown that such rewriting does not significantly change the 
number of plus responses obtainable from a group of medical patients who had taken 
the set of items both in the original and in the rewritten forms. 


RESULTS 


The 25 items were put into random order and administered to a group of 100 
surgical patients. The mean and standard deviation of their “yes” scores will be 
found in Table 2 with those for other groups. 


TaBLE 2. Means, STANDARD DEVIATIONS, AND SAMPLE Si1zBs AVAILABLE FOR 
Various Groups 








Groups N 





Surgical Patients 100 
Medical Patients 80 
Psychiatric Patients 55 
VA Employees (male) 30 
College Students (male) 18 
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The critical ratio of the difference in means for the surgical and psychiatric 
groups is 12.76, which has a chance probability of less than .001. 

Reliability of the scale was estimated by comparing the scores made on the two 
forms of the set of items by a group of 47 medical patients. In every case, both forms 
were taken by the patient on the same day or on one day apart. The means and 
standard deviations for the “‘yes’” scores were: for the rewritten items, mean 8.45 
and s. d. 4.54, for the items in their original form, mean 10.15 and s. d. 4.25. The 
correlation coefficient was .74 with confidence limits set at .51 and .87 by Fisher’s 
r to z transformation ®). The slight difference in means (although not significant, the 
critical ratio of their difference being .54) suggests that the true value of r may have 
been attenuated by estimating it from the two different presentations of the items. 
The fact that the critical ratio was not significant would also suggest that the basic 
meanings of the items had not been changed through their having been rewritten. 
Table 3 presents the percentages of “‘yes’”’ scores for groups of psychiatric, medical 
and surgical patients. 


TaBLe 3. PeRcENTAGES or “YxES” Scores TO THE SCALE OF ITEMS BY 
PsYcHIATRIC, MEDICAL AND SurGIcAL Groups 








“Yes” Psychiatric Medical Surgical 
Score (N 55) (N 80) (N 100) 
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Note: 3% of Surgical Patients had zero scores 


Discussion 

Using the sample statistics of the surgical group as normally distributed, a 
cutting score of 16 “yes” responses (which occurs approximately two standard de- 
viations above the mean of the surgical group) should identify about 50% of the 
psychiatric patients at the risk of misidentifying as psychiatric about 2% of the 
surgical patients. This would represent a difference of about 48%. However, Table 
3 shows that the percentage difference for a cutting score of 16 is closer to 60%. This 
increase in efficiency may be due to the score-pattern approach in finding the criterion 

roup. 

, The question could be asked whether the 4% of the surgical] patients mistakenly 
picked out as in need of psychiatric treatment are actually mistakes in prediction. 
The names of the four patients involved were obtained and their medical histories 
reviewed. The following data was found: 
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Patient A. Admitted with and treated for a spastic adduction deformity of the right foot. 
Patient B. Admitted with and treated for an indirect left inguinal hernia. 


Patient C. Admitted with low back pain. No physical basis for the pain was found on 
x-rays. Treatment was by novocaine nerve block. 


Patient D. Admitted with and treated for adenomatous rectal polyp. Also carried service- 
connected disability grant for Psychoneurosis, Anxiety. 


Although all four patients may have had psychiatric complaints which might 
have been verified by closer examination, there is certainly little doubt in the case 
of Patient D in view of his disability rating. There is also a probability of a neurotic 
syndrome in Patient C since no physical basis was established for his low back pain. 
If these two patients could be considered as correct predictions by the scale, then the 
percentage of misidentification would be lowered. 

Does the scale of items identify for treatment all the psychiatric syndromes or 
is it selective for just certain ones? For example, examination of the item content 
shows a multitude of tension and distress complaints which is borne out also by the 
fact that the number of items overlapping with the standard MMPI clinical scales is 
highest for the Pt scale. 

This type of inefficiency was investigated by comparing the incidence of major 
diagnostic groupings for those patients in the psychiatric group with ‘“‘yes’’ scores of 
16 or more, with the incidence of major diagnostic groupings on the psychiatry ser- 
vice over a period of one and one-half years (January 1953 to June 1954). The 
sample size of the psychiatric group was increased to 45 for this phase of the 
analysis. Table 4 summarizes the percentages obtained. 


TABLE 4. PERCENTAGES OF INCIDENCE FoR Masor Diacnostic Groups 








Diagnostic Psychiatry Service Patients with Percentage 
Groups Jan. 1953 - June 1954 “Yes” Scores Difference 
16 or More 





Psychoses 31 (16%) 22%) 

Psychoneuroses 108 75683 
Anxiety 
Conversion 
Hypochondriacal 
Depressive 
Mixed 
Other 

Conduct Disorder 

Alcoholism 

Total 
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Chi-square: 7.488 
Degrees of freedom: 3 
Probability: between .10 and .05 





Although the chi-square of the difference in proportions was not significant, 
since its probability level was less than .10, some consideration should be given to it. 
The lowered incidence of psychoneurotics found in the “screened” group (16% lower) 
suggests that specific neurotic syndromes may have been overlooked by the ‘ecale. 

This possibility was examined by a further breakdown of the psychoneurotic 
diagnostic grouping into specific syndromes. Table 4 presents the figures. It is ap- 
parent that the groups available are too small to permit statistical analysis. This 
phase of analysis must await larger samples of patients for which both the scale 
scores and diagnoses are available. The scores of patients carrying diagnoses of 
conversion reaction or psychophysiological reaction should probably be closely in- 
vestigated because one of their clinical features is the denial of tension and distress. 
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SUMMARY 


A criterion group of patients was established by finding those patients who were 
more typical of the total patient group in that their MMPI scores were within one 
standard deviation of the mean profile of the total group. The “total’’ group was a 
random sample of patients from the patient population of a VA male psychiatry 
service. A scale of 25 items was obtained by comparing the incidence of answers of 
the criterion group to the incidence of answers found for two different normative 
groups. The scale seems to predict which patients would be in need of psychiatric 
treatment with a small percentage of misidentification. Rewriting MMPI items for 
the scale does not seem to alter their validity. 
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A QUANTITATIVE METHOD OF SCORING PICTURE STORY TESTS' 
CARL N. ZIMET AND HAROLD J. FINE 
Veterans Administration, Palo Alto, California 


PROBLEM 


Since publication of the TAT by Morgan and Murray in 1935, the test has been 
widely used in psychological practice. The approaches to interpretation have differed 
widely, ranging from the highly structured need-press system of Murray and his 
co-workers to the rather haphazard approach of clinical intuition and empirical 
evidence. The method of TAT analysis designed by Murray“? is satisfactory to 
only a limited group of researchers. Major criticisms revolve around its basic theory 
of personality structure which as yet is not commonly accepted. Thus, any TAT 
or picture-story researcher who does not accept that theoretical framework finds it 
difficult to adopt the method of analysis. Another criticism is directed at the nomen- 
clature which the authors devised and which is complicated and confusing. It is also 
extremely difficult to analyze data arrived at by this method statistically. We have 
attempted, in this study, to devise a method of analysis which could be more widely 
accepted in theory and which would yield data that could be statistically manipu- 
lated. 

While such previous TAT workers as Aron“, Sanford“), Masserman and 
Balken “ have used psychiatric and psychoanalytic constructs as the basis for scor- 
ing and interpreting the records, this study is based primarily on the phenomenolo- 
gical frame of reference. In the phenomenological approach, behavior is seen as a 
function of the individual’s perception of self and the world about him. On this 
basis, then, key questions for understanding behavior are: ‘“‘How does the individual 
feel about himself and others, particularly other adults, and children?” This study 
is the report of a method of measuring change in attitude by use of a picture story 
test. It was utilized in a research project in which a group of school administrators 
engaged in a group therapeutic experience ®?. 


‘This paper is based on the doctoral dissertation of the senior author at Syracuse University. The 
authors wish to express their grateful acknowledgement to Dr. Arthur Combs for his help and guidance. 
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DESCRIPTION OF THE TEST 


A total of 18 stimulus picture cards were used. Six of the cards were from the 
original TAT, six were from a set of cards used in industrial research, and six were 
specifically drawn up to depict school situations. Although the procedure and find- 
ings are based on this unique combination of pictures, this same method of analysis 
could be used with any type of picture-story test. Since no ready instrument was 
available to quantitatively rate the variables of attitude towards self, adults and 
— from a picture-story protocol, six five-point rating scales were devised for 
each. 


I. Attitude towards self: In this category we are interested in how the subject telling the story 
looks upon himself. In order to assess several of the facets of this attitude, six different five-point 
scales were constructed. 


A. Sees self as adequate to inadequate. 

Score 1. Sees self as quite adequate to deal with the problems which he is confronted 
with. Approaches problems and their solutions with confidence and has belief in his own 
ability to handle them efficiently and effectively. 

Score 2.. Considers self as fairly adequate to deal with problems; is more or less con- 
fident of own ability. 

Score 3. Sees self as not very adequate and does not have too much confidence in his 
ability to deal with problems. 

Score 4. Feels himself to be more or less inadequate and most of the time doubts own 
ability to handle problems effectively. 

Score 5. Sees himself as quite inadequate to deal with problems. Approaches them fear- 
fully and with doubt about ability to effectively overcome them. 

B. Sees self as liked to disliked. 

Score 1. Sees self as well liked by other people. Feels confident of the good feelings of 
others towards him. 

Score 2. Regards self as fairly well liked by others. 

Score 3. Is doubtful of attitudes or liking of others for him. 

Score 4. Sees self as not particularly liked by others. 

Score 5. Sees self as disliked by others. 

C. Sees self as acceptable to unacceptable. 

Score 1. Feels accepted by other people, natural and easy member of the group. Feeling 
of belonging and ease in the group. 

Score 2. Feels fairly well accepted by other people. 

Score 3. Quite unsure regarding his acceptance by others and his group membership. 

Score 4. Feels himself to be not particularly well accepted by others. 

Score 5. Considers self to be quite unaccepted by others; is threatened by the group 
structure; an outgroup member. 


D. Sees self as democratic to authoritarian. 


Score 1. Sees self to be keenly understanding, accepting and fair. 

Score 2. Feels himself to be more or less understanding and accepting of others. 

Score 3. Considers self to be somewhat understanding and accepting of others. 

Score 4. Fairly strong emphasis on his authority position; sees self as not particularly 
understanding and accepting. 

Score 5. Relies heavily on his authority to handle situations in high handed manner. 


E. Sees self as equal to others to inferior to others. 


Score 1. Sees self as free and easy, equal to other people. Behaves without doubt as to 
his acceptability and reaction from others. Sees others as capable and effective. 

Score 2. Feels self to be somewhat less equal to others, slight feeling of inferiority. 

Score 3. Not at all certain of his relationship with other people; feels himself to be some- 
what inferior. 
‘ Score 4. Regards self as fairly incompetent and unworthy; fairly strong feelings of in- 
eriority. 

Score 5. Considers self as very much inferior, feels quite degraded and unable. 


F. Sees self as equal to others to superior to others. 


Score 1. Sees self as free and easy, equal to other people. Behaves without doubt as to 
his acceptability and reaction from others. Sees others as capable and effective. 

Score 2. Feels self to be slightly superior to others. 

Score 3. Not at all certain of his relationships with other people; feels self to be some- 
what superior. 

Score 4. Regards self as rather superior, feels others are not very competent or worthy. 

Score 5. Considers others as very much inferior to himself, feels self to be very much 
more competent, worthy, etc. 
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II. Attitude towards other Adults: In this section we are interested in how the subject feels 
towards other adults. The following variables were devised: 


A. Sees other adulis as acceptable to unacceptable. 


Score 1. Regards other people as acceptable; sees them as persons of dignity and in- 
tegrity. Sees them as equal to self and treats them with sympathy and understanding. 

Score 2. Regards others as more or less acceptable, sees them as people with some 
dignity and worth. 

Score 3. Sees others as to some extent acceptable and equal with self. 

Score 4. Regards others as generally inacceptable and inferior. 

Score 5. Regards others as inacceptable, inferior, without integrity and competence, 
rejects them for one reason or another. 


B. Sees other adults as liking them to disliking them. 


Score 1. Is genuinely fond of other people. Sees them as people with integrity and 
dignity and shows real evidence of sympathy and understanding for them. Enjoys their 
company. 

Score 2. Likes others fairly well; shows some sympathy and understanding. 

Score 3. Unsure about others, likes people to some extent; shows a little sympathy and 
understanding. 

Seore 4. Rather dislikes others; considers them of somewhat lesser value. 

Score 5. Sees others as threatening and avoids contact; is fearful and suspicious of them. 


C. Sees other adults as competent to incompetent. 


Score 1. Sees others as competent and able to deal with the situations they confront. 
Perceives and appreciates the capabilities which others possess. 

Score 2. Sees others as competent to some extent, able to deal fairly adequately with 
situations confronting them. 

Score 3. Regards others as not very competent; +. “ye their capabilities very little. 

Score 4. Sees others as rather incompetent with tasks. 

Score 5. Sees others as quite incompetent, untrustworthy and unable to deal effectively 
with situations in which they are engaged. 

D. Sees other adults as friendly to unfriendly. 

Score 1. Sees others as sincerely friendly, helpful and of praiseworthy motivation. Sees 
villains as victims of circumstances and personal difficulties. 

Score 2. Sees others as fairly friendly, helpful and of fairly decent motivation. 

Score 3. Regards others as friendly and helpful to some extent. 

Score 4. Sees others as not friendly, but somewhat threatening. 

Score 5. Regards others as unfriendly, treacherous, untrustworthy and threatening to 
self and ends. 


E. Sees other adults as responsible to irresponsible. 


Score 1. Regards others as responsible, mature and capable for their station; sees others 
as — of making up own minds and arriving at own decisions; require little direction. 

Score 2. Regards others as fairly mature and responsible, more or less able to arrive 
at their own decisions; require little direction. 

Score 3. Considers others as not very mature and somewhat irresponsible, requiring 
some direction and supervision. 

Score 4. Sees others as fairly immature and not very trustworthy, requiring a good deal 
of supervision. 

Score 5. Others are regarded as irresponsible, erratic and untrustworthy, requiring con- 
tinual direction, supervision and coercion. 


F. Sees others as trustworthy and sincere to untrustworthy. 
Score 1. Sees others as sincere and trustworthy. 
Score 2. Considers others as more or less sincere and to be trusted, but not always so. 
Score 3. Sees others as not particularly sincere and not to be particularly trusted. 
Score 4. Regards motivation of others to be open to question. 
Score 5. Sees other people as insincere and untrustworthy. 


III. Attitude towards Children: Here it was important to describe quantitatively how the story 
teller looks upon children. The six five-point scales making up this attitude category are identi- 
cal with those used in the attitude towards adults scales, except for the substitution of the 
word “children” for “others”. 


Thus there are 18 individual five-point rating scales to measure the way an 
individual sees himself and the world. Through these individual scales we can get 
at segmental aspects of his attitudes, such as whether he sees other adults as com- 
petent or incompetent. On the other hand, by using broader groupings we can find 
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out how he feels towards himself, other adults, and children. When these 18 scales 
are combined, it is possible to arrive at a reliable reflection of the subjects’ overall 
attitude constellation. A comparison of pre-test to post-test scores can serve as an 
indication of how an individual or a group has changed through the introduction of 
such a variable as psychotherapy. 


CONGRUENCE OF AGREEMENT AND TEsT RELIABILITY 


In using a new system of scoring a picture-story test, it is fundamental that the 
examiner’s ratings be reliable. Due to the rather loose and often misleading applica- 
tion of the term “reliability” it is advisable not to use it. McFarlane“ in a dis- 
cussion of projective measurements has suggested the use of the term ‘‘congruent” 
as a substitute when dealing with projective data or, for that matter, any inter- 
pretational material. The term is not only more descriptive, but also seems to be a 
more accurate one. Webster defines ‘‘congruent”’ as: “possessing agreement between 
interpretation of judge and investigator’. Combs®? in discussing this problem 
raised the following questions: 1. To what extent can persons other than the re- 
searcher arrive at the same results in the analysis of the stories? 2. To what extent 
does a repeated analysis by the same researcher, after a passage of time, give the 
same results? 

Before attempting to check on the external congruity of these analyses, the two 
judges used in this experiment were given a period of training in the technique of 
analysis for the various scales. Each judge was an advanced graduate student in 
psychology. They were given four protocols, each of which was chosen at random 
from a file of 28 picture story test transcripts. As shown in Table 1, the percentage 
of agreement of each of the judges with the investigator as the criterion ranged from 
53% to 86%. Out of the 1296 scale items on the four protocols used for this check, 
the over-all mean agreement on the combined attitude scales for both judges was 
69.56%. 


TABLE 1. Per Cent AGREEMENT OF TRAINED JUDGES WITH AUTHOR’S 
ANALYSIS AS THE CRITERION FOR 2 PST Protoco.s 








PST Attitude Towards Mean 
Protocols Self Adult Child Seales 


Judge’ I 
Protocol a 75.18 3.94 74.12 71.08 
Protocol b 74.07 72.22 86.11 77.80 


Judge IT 
Protocol a 53. 57. 86.11 65.77 
Protocol b 52. 56. 81.48 63.58 











Mean . 9 62.50 81.96 69.56 








As a further check on the degree of agreement, tetrachoric correlations were 
computed on the three major scales for the two judges with the senior author as the 
criterion. For judge I the r, was .93 and for judge II the r, was .87, both significant 
beyond the .01 level of confidence. 

Two months after the original analysis was completed, two protocols were 
drawn at random and re-scored by the investigator. The results of this re-analysis 
appear in Table 2. In comparing the data of this table with those of Table 1, it will 
be found that the self or internal agreement is appreciably greater than the external 
agreement. This may be due to some of the following factors: 1. Greater familiarity 
with all the scales. 2. Greater consistency within one personality. 3. The likelihood 
of developing a certain manner of responding to the protocol material. The relia- 
bility, or as it has been termed here, the congruence of agreement both externally 
and internally seems to be high enough to justify this method of analysis. 
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TaBLe 2. PeR ceENT AGREEMENT OF AUTHOR WITH SELF ON SAMB 
'T PRoTroco.s AFTER A PERIOD oF 2 MonTHS 








PST Attitude Towards 
Protocols Self Adult Child Mean 


Protocol a 86.88 83.07 94.37 88.11 
Protocol b 84.34 88.52 89.12 87.33 
Mean 85.61 85.80 91.74 87.72 











SUMMARY AND CONCLUSIONS 


A procedure based on the phenomenological frame of reference was devised to 
quantitatively score and statistically manipulate story test protocols. The scoring 
areas that were used were three primary aspects of attitude: towards self, other 
adults, and children. With the exception of the latter, each of the three attitude 
scales was made up of six subscales each tapping a different aspect of the particular 
attitude pattern. Thus, there are a total of 18 scales, each based on a five point 
continuum by which a record is scored. The reliability of scoring on these scales has 
already been demonstrated in this study and a beginning has been made in attempt- 
ing to establish its validity by correlating the findings from the picture story test 
with Fine’s Verbal Participation Rating Scale®?. 

The scale described herein was utilized in a group therapeutic situation to 
measure attitude and motivational change. A picture story test was administered 
at the beginning and end of 16 four hour sessions to the group members. When com- 
paring the pre- and post- picture story test scores, statistically significant positive 
changes were noted on all of the major scales ®?. 

It seems that such a scoring procedure based on the phenomenological approach 
could be modified to investigate attitudinal structures in a variety of situations. With 


a suitable choice of stimulus cards and scoring categories, it would be possible to 
measure attitudes towards management and labor, minority groups, etc. Further- 
more, changes in attitudes resulting from psychotherapy or social action techniques 
could be determined by this method. 
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DOLL PLAY PHANTASIES OF NEGRO AND WHITE 
PRIMARY SCHOOL CHILDREN 


THOMAS F. GRAHAM 
Massillon State Hospital, Massillon, Ohio 


PROBLEM 


A survey of research comparing racial groups reveals the necessity for more re- 
liable techniques. The majority of previous investigations dealt with interracial 
problems involving variables too numerous for adequate control. °: *) In view of the 
fact that the methodology for interracial studies can not be properly designed and 
validated until further data is obtained, this study investigates intraracial factors. 
The objectives of the present research are as follows: (a) To design and standardize 
a projective doll play technique for studying the family phantasies of Negro and 
white primary school children. (b) To isolate if possible, psychological factors which 
may contribute to the development of children’s phantasies. 

Specifically, the hypotheses which were formulated and tested are as follows: 
(a) Quantitative differences exist between the phantasies of Negro primary school 
children because of their sex, development, and individuation. (b) Quantitative 
differences exist between the phantasies of white primary school children because of 
their sex, development, and individuation. 


METHOD 


The methodology employed two projective doll play techniques in similar 
sequences and forms independently among racial groups. The projective doll play 
method previously has been used by Baruch,®? Bach,“ Phillips,“ Pintler,™ 
Robinson, ®? and Sears,“ but none of these investigators applied the method to 


study intraracial factors. 

The doll play experiment began with seventy-four children ranging in age from 
73 to 102 months. In this group ten subjects were used to determine the reliability 
and various other methodological problems. However, the final analyses were based 
upon the phantasies of thirty Negro children and thirty white children. Because of 
significant sex differences previously found by Pintler, Phillips and Sears, “) further 
subdivision was made with respect to the sex to decrease the variability im these 
differences. 

One group of the children attended Edmund A. Jones School of Massillon, Ohio, 
while the larger group came from Benjamin Franklin School also of Massillon.* 
Both schools were located in the same section of the city and enrolled Negro and 
white children. An attempt was then made to determine how much effect sex, de- 
velopmental factors and individual differences might have had on the data collected. 
Based upon these three variables, homogenous groups were selected for each racial 
group and intraracial comparisons were made. 

The doll play theme used was that of a home. After the child is oriented to the 
stage set of the home, the examiner presents him with a family of 4 dolls depicting 
the subject’s racial group. At the end of 10 minutés the examiner discontinues re- 
cording and then introduces the subject to another doll family of four represent- 
ing, however, the opposite racial group. Following another 10 minutes of recording 
with the second family, the examiner presents both families of dolls together for a 
final 10 minute period of recording. 


RELIABILITY Stupy 
The reliability of the projective doll play methods was established by two ob- 
servers working at the same time, but independently. Information concerning re- 


*Grateful acknowledgement is made to L. J. Smith, Superintendent of Massillon Schools, who 
permitted the study of children attending classes there. 
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liability was determined by five major categories, namely: (a) frequency of stereo- 
typed actions, i.e. those dramatizations which in content simulate habitual routine 
actions and experiences which could be ordinarily expected to be performed by real 
persons in an analagous, actual setting, (b) frequencies of nonstereotyped actions, 
i. e. those dramatizations which in content simulate unexpected relationship of in- 
dividuals or groups, (c) frequencies of group action, i. e. whenever three or more dolls 
are projected as doing a similar activity together, (d) frequencies of nonthematic 
responses, i.e. those idiosyncracies of the subject’s behavior in the experimental sit- 
uation other than his phantasies, (e) frequency of all the above responses. 

The percentages were founded upon a comparison of the phantasies of thirty 
5-minute records from five subjects representing each racial group. These records 
were taken simultaneously by two examiners from the ten children. The agreement 
found varied from 55.5 per cent agreement to 93.2 per cent agreement for the Negro 
children and 75.0 per cent to 96.5 per cent agreement for the white children. The 
lower reliability values of the nonthematic responses may have been due to the in- 
frequency of such responses. Validity of the test is claimed on the basis of its clear 
differentiation of the subjects. Both in the Negro and in the white children, differ- 
ences between the responses among the racial groups to the majority of catagories 
were significant. 

The self consistency of the test was determined by comparing the nonstereo- 
typed phantasies projected by the child with the answers given by the child on an 
explicitly designed inquiry. According to Conn“ the play interview permits the 
child to express himself in such a manner that the doll is considered responsible for 
the expressions. Therefore, the self consistency of the test was based upon the fol- 
lowing conditions: (a) the types of doll play phantasies projected by the children 
whose inquiry responses were evident, and (b) the types of inquiry responses as re- 
flected by the children’s various types of doll play phantasies. To illustrate, ten of 
the thirty Negro children projected the mother doll as being most affectionate on 
their phantasies, while 55 per cent of the ten children stated that the mother doll 
loved the other dolls most often. The category of rejection was projected most fre- 
quently through the father doll by the Negro children. On the inquiry they stated 
that the father doll also hated the others most often. The children’s phantasies 
showed the most frequent agents of frustration to be the parent dolls. Responses to 
the question, ‘‘Which doll stops others from doing things most often?” also indicated 
the parent dolls, with the mother figure the principal agent. The category of ag- 
gression was expressed most frequently through the girl doll by the Negro subjects. 
However, on the inquiry they replied that the father doll would hurt others on more 
occasions. 

A similar analysis was made between the phantasies of the white children and 
their responses to the inquiry. Here again, doll for doll and family for family, the 
ratio was above chance except for the girl figure. The fact that the girl doll ap- 
peared in relatively few themes may have determined the low self consistency score. 
An interview was also prepared for the stereotyped responses, which resulted in al- 
most perfect agreement between the phantasies and the inquiries because of the 
great number and kind of responses. 


RESULTS 


The first major hypothesis was that quantitative differences exist between the 
phantasies of Negro primary school children because of their sex, development, and 
individuation. The results of intraracial sex differences among the Negro chidren 
are summarized in Table 1. The application of the t-test to the responses of the 
two groups indicated that 9 of the 16 comparisons made were significant, with five 
of the analyses significant at or below the five per cent level of confidence and four 
comparisons at the one per cent level. The most evident results between the Negro 
sexes were: (a) The girls manifested more stereotyped acts than the boys. (b) The 
doll play in five out of seven locations was significantly different between the sexes. 
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TaBLeE 1. INrTRARACIAL Sex DIFFERENCES WITH RESPECT TO STEREOTYPE AND NONSTEREOTYPE 
PHANTASIES 








Negro Children’s 10 Girls 10 Boys 
Phantasies 
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(c) The girls projected the dolls into the dining and living rooms a greater number of 
times. (d) No difference was found in total nonstereotyped acts. (e) Three out of 
seven nonstereotyped sulthemes were significantly different. (f) The girls mani- 
fested more affection. (g) \o difference was found in the amount of rejection between 
the sexes. (h) No differences were evident in frustration acts. (i) The boys produced 
a greater amount of ageression. (j) No significant differences were found in the voli- 
tional acts motivated by ends or means. (k) The girls produced more commands. 

A comparison was then made between nonstereotype projections of the first 
grade children and the nonstereotype phantasies of the second and third grade 
children. The tabular data were taken from trial two when the children were playing 
with dolls representing the race other than their own. The raw frequencies found 
show a trend toward more affection among the Negroes of the second grade toward 
the white dolls, while their rejection is conversely lower. On the other hand, grade 
three of the Negro children showed an atypical trend. However, when comparing 
affection to rejection, the former remained markedly higher. Although these findings 
were not statistically significant, their implications were evident. 
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The mean scores of the stereotyped responses produced by the brighter Negro 
children were compared with the productions of the less bright children and treated 
as an uncorrelated sample. Seven Negro subjects, three boys, and four girls made 
up one group with an IQ average above 115 on the Goodenough Scale. Their stereo- 
typed phantasies were subsequently compared with seven other Negro children, 
five boys and two girls whose average IQ was 88.7, and resulted in significant differ- 
ences at the two per cent level and within but twelve degrees of freedom. 

Insofar as individual differences are concerned, the range of stereotyped res- 
ponses for the Negro children was 78 to 291. Of the grand total of phantasies, 93.2 
per cent were of the stereotyped category. It was interesting to observe the differ- 
ences with respect to the location of the thematic actions. The room which the 
children used for most of their phantasies was the living room. The fewest phantasies 
involved the kitchen. These quantitative results may be indices of reality which 
likewise enhance the validity of the test. Concerning group action, the Negro child- 
ren showed an individual range from one group act per three trials, up to twenty- 
eight group actions. The average number of group acts was 13.8, with a standard 
deviation of 5.8 and a grand total of 416 group acts. 

The results of testing the second hypothesis, that quantitative differences exist 
between the phantasies of white primary school children because of their sex, de- 
velopment and individuation are summarized as follows. Intraracial comparisons 
were also made of the average differences between boys and girls concerning stereo- 
typed responses. Sixteen comparisons were made with 10 statistically significant. 
With regards to the stereotyped responses, four of the five significant differences were 
below the five per cent level of confidence. Concerning the nonstereotyped acts, 5 of 
8 comparisons were found to be significant, ranging from two per cent to the twenty- 
five per cent level. These results are also given in Table 1. 

The most evident findings between the white sexes were: (a) The girls projected 
more total stereotyped acts. (b) The doll play in four out of seven locations was 
significantly different between the sexes. (c) The girls projected the dolls into the 
children’s bedroom and living room a greater number of times. (d) The girls pro- 
jected more nonstereotyped acts than the boys. (e) Four out of seven nonstereo- 
typed subthemes were significantly different. (f) The girls manifested more affec- 
tion. (g) No difference was found in the amount of rejection between the sexes. 
(h) No differences were evidenced in frustration acts. (i) The boys produced a great- 
er amount of aggression. (j) Significant differences were found in the volitional acts 
motivated by means to a goal. (k) The girls gave more commands and orders. 

With respect to the nonstereotype projections of the first grade children as 
compared to the nonstereotype phantasies of the second and third grade children, 
the white children showed a similar ratio to the Negro children. The comparison was 
subsequently made between the intellectual ability and phantasy projections of the 
white children. These comparisons were found to be statistically significant at the 
one per cent level, in spite of the fact that they were also made within but 12 degrees 
of freedom. 

Concerning individual differences the white children produced a range of phanta- 
sies from 90 to 326. The relationship of stereotyped phantasies to nonstereotyped 
phantasies was likewise considerably different from child to child. It was also ob- 
served that the children projected a grand total of 91.8 per cent stereotyped phanta- 
sies. The fewest phantasies with respect to the white children involved the bathroom. 
Here again these responses may also be indices of reality. 

Concerning group action, the white children produced as few as eight group acts, 
and as many as twenty-five with a mean of 17.2, and a standard deviation of 5.6, and 
a grand total of 517 group acts. Analysis of the unclassified responses was not sig- 
nificant with the individual range of from one per cent up to a high of sixteen per 
cent. However, in the initial stages of recording, more unclassified responses were 
evident, but after studying the verbatim comments of the subjects, these phantasies 
were appropriately tabulated. 
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CONCLUSIONS 


It is the opinion of the author that the data substantiate both of the hypotheses 
which were formulated. The first and second hypotheses were supported in approx- 
imately equal strength. The results of the investigation suggest the following gen- 
eral conclusions: The family phantasies of children from different racial groups may 
be elicited by projective doll play in a parallel manner. The phantasies may also be 
quantified and analyzed objectively. The fact that various family phantasies could 
be recognized and measured suggests the possibility that projective doll play meth- 


ods may be an additional device for the investigation of child development and racial 
relations. 
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QUALIFICATIONS OF THE CLINICAL PSYCHOLOGIST FOR THE 
PRACTICE OF PSYCHOTHERAPY! 


ALBERT ELLIS, JULE NYDES AND BERNARD F. RIESS 
New York City 


Psychotherapy is the treatment of individuals with psychological problems. It 
is a process of emotional reeducation since it involves helping people to cope with 
problems which they developed or learned in the course of their interpersonal re- 
lations. In the process of psychotherapy, the emotionally disturbed individual, 
through a relationship with his therapist, is helped to understand himself and others, 
to work through and unlearn his immature, irrational ways of behaving, and to 
learn how to relate to others and to himself in a more adult, effective manner. (In 
child psychotherapy, of course, children are not helped to become adult but to be 
more effective and happier as children.) 

Because psychotherapy attempts to discover why the individual became dis- 
turbed in his relations with himself and others, and how he may be helped to become 
emotionally adjusted and to live happily and effectively, it is practiced, to some ex- 
tent, by all professional and lay people who importantly influence the individual’s 
human relations—including teachers, clergymen, personnel counselors, jurists, pro- 
bation officers, and parents. Those who are specifically trained for psychotherapy 
and who practice it as a profession are becoming more numerous every day, and at 


This statement was prepared at the request of the Joint Council of New York State Psychologis 
on Legislation. Helpful suggestions were made by Michael Amrine, J. Mc V. Hunt, Fillmore H. vs 


ford, and L. Joseph Stone, who saw a preliminary draft of the statement. The authors assume all 
responsibility for the views expressed. 
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present include clinical psychologists, psychiatrists, psychiatric social workers, mar- 
riage counselors, and members of several other professional groups. 

Because so many individuals in our society have psychological problems, be- 
cause these problems stem directly from their relations with others, and because the 
field of human relations includes many different kinds of professional workers, psy- 
chologists (as indicated in the official position of the American Psychological Asso- 
ciation) believe that the professional practice of psychotherapy should not be re- 
stricted to individuals trained in any single discipline, but should be the responsibil- 
ity of competent therapists of various professional backgrounds. 

Among those who are specifically trained for and who engage in the practice of 
psychotherapy, the clinical psychologist has a background that well qualifies him 
to be a psychotherapist. Not all clinical psychologists, to be sure, are interested in 
the practice of psychotherapy, since many specialize in psychodiagnostic testing. 
There are also other kinds of psychologists, such as experimental, physiological, and 
social psychologists, who may engage in areas other than psychodiagnostic testing 
or psychotherapy. No justifiable contention can be upheld, therefore, for the right 
of all psychologists to practice psychotherapy, any more than it can be held that all 
physicians should do major surgery or all lawyers should be tax consultants. 

There are a number of reasons why clinical psychologists who are trained for 
the practice of psychotherapy are well prepared to engage in this practice: 


1. Psychotherapy is and must necessarily be rooted firmly in the science of 
human behavior. Unless it is clearly understood what human beings are, how they 
get to be the way they are, and how they can be helped to change their immature and 
ineffective ways of behaving, little effectual psychotherapy can be accomplished. 
But psychology (in association with sociology and anthropology) is the basic science 
of human behavior, and most of what we now know of infant, child, adolescent, and 
adult individual behavior, as well as much of what we know about group or social 
behavior, has been discovered by psychologists. Clinical psychologists, in particu- 
lar, are excellently equipped to treat individuals with behavior disorders—since 
human behavior and its disorders is the one main area in which they are scientifically 
trained, in which they do fundamental research, and to which they specifically apply 
their time and talents. 


2. Psychologists (together with sociologists, educators, anthropologists, phys- 
iologists, engineers, and other scientists) are students of the processes of learning and 
communication. Emotional and mental disturbances are basically habits learned by 
people in the course of their attempting unsuccessfully to communicate with other 
individuals as they are growing up. The disturbed person is one with habits of lang- 
uage, of beliefs, of attitudes, of emotional responses, etc., which make it difficult for 
him to communicate effectually with other human beings or even with himself. 
Psychotherapy, at bottom, is effective communication between the therapist and 
the disturbed person in the course of which the latter learns new habits of language, 
belief, attitude, and emotional response which enable him to communicate more 
adequately with himself and others. In short, he learns to replace his poor habits 
with others which are more effective for living. Psychotherapy is an interpersonal 
relationship between therapist and client through which the latter learns how to 
achieve mature relations with others. As a basic investigator, student, and applier 
of the science concerned with human learning and human communication and inter- 
personal relations, the clinical psychologist is especially well equipped to practice 
psychotherapy with emotionally disturbed individuals. 

Otherwise stated: emotional and mental disturbances result from an individ- 
ual’s infantile, irrational, confused attitudes, beliefs, prejudices, ideas, or concepts. 
Normal and disordered attitudes, beliefs, prejudices, ideas, and concepts are one of 
the main areas of research and experimentation of the psychologist, especially the 
social and clinical psychologists. The clinical psychologist, therefore, has a fine back- 
ground for discovering how to help a disturbed person reorient his ideas and atti- 


tudes so that he no longer need think, emote, and behave in an ineffective, self-de- 
feating manner. 
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3. The science of psychology includes the study of physiological psychology 
or neuroanatomy—that is, the study of the functioning of the nervous system of the 
human body. Some of the basic research in mental deficiency, epilepsy, cerebral 
palsy, multiple sclerosis, organic psychoses, psychosomatic disorders, and other be- 
havior deviations with neurological involvement has been and is now being done by 
clinical, experimental, and physiological psychologists. Knowledge of physiological 
psychology enriches the background of a psychotherapist who treats the psychological 
problems of persons suffering from nervous disorders. Physical diseases of the ner- 
vous system remain, of course, the. province and responsibility of medically trained 
specialists. 

The physical symptoms of epilepsy, for example, should be medically treated 
with appropriate drugs and other procedures, but the personal or psychological 
problems which many individuals experience because they are epileptic, as well as 
the psychological disturbances which may contribute to the genesis or the aggrava- 
tion of this disorder, are treated psychotherapeutically. In other words, psycholo- 
gists treat emotional disturbances which may give rise to or follow from physical 
symptoms; they do not treat the physical symptoms themselves. 

Most medical as well as non-medical psychotherapists tend to agree that it is 
contrary to good practice for the psychotherapist to undertake any physical exam- 
ination of his clients. Physicians and psychologists who practice psychotherapy 
refer their clients to medical specialists for the diagnosis and treatment of physical 
complaints. This does not mean that an individual’s physical and psychological 
problems are entirely separate or unrelated; on the contrary, they are quite inter- 
related, since, as modern research increasingly shows, psyche and soma, or mind and 
body, are unity. But the human organism is so complex that there must usually be 
some specialization and division of labor by those who help to treat its malfunctions. 
Thus, there are specially trained and qualified oculists, otolaryngologists, and allerg- 
ists among medical men. Thus again, there are specially trained and qualified clinical 
psychologists (and psychiatrists and psychiatric social workers) who specialize in 
psychotherapy, or the treatment of ‘‘psychological’”’ problems while taking cog- 
nizance of, but not attempting to treat, ‘“‘physical’’ ailments. 

It is interesting to note in this connection that Freud (Gesammelte Werke, Vol. 
XI, p. 408) remarked that “the medulla oblongata is a very serious and beautiful 
thing. I remember very well how much time and trouble I devoted to the study of it 
years ago. Today, however, I must say I do not know of anything that seems to be 
more irrelevant for the psychological understanding of anxiety than a knowledge 
of the nervous paths its excitations follow.’’ Since Freud made this remark consider- 
able research has been done in the physiology and anatomy of the nervous system, as 
a result of which we may be on the verge of making neurological knowledge relevant 
to the understanding of the mechanisms of anxiety. Much of this research on the 
nervous system has been and is being done by psychologists, and to the extent that 
it becomes important in psychotherapy, clinical psychologists are well equipped to 
make the most appropriate use of it. 


4. Psychologists engage in fundamental researches into the behavior of the 
most seriously emotionally and mentally disturbed individuals, including delinquent, 
psychopathic, brain injured, and psychotic persons. Clinical psychologists normally 
serve as internes and work under psychological, medical, and psychiatric supervision 
in mental hospitals, mental hygiene clinics, correctional] institutions, homes for the 
mentally defective, and other agencies where some of the most disturbed individuals 
are commonly seen. Clinical psychologists thereby acquire training and experience 
in understanding and treating these kinds of disturbed persons, although they are of 
course not qualified to use any form of physical medicine, such as drugs, shock 
treatment, or surgery. 


5. Human emotional and mental difficulties, particularly those of children, 
are often linked to specific educational difficulties. Disturbed individuals, for ex- 
ample, frequently cannot read, spell, or do arithmetic as well as they are potentially 
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able to do; and, by the same token, persons who cannot read, spell, or do arithmetic 
well frequently become emotionally upset. Also, individuals with physical handi- 
caps, such as sight, hearing, or motor defects, may become emotionally disturbed; 
and disturbed persons may acquire sensory and motor defects such as stuttering. 
Psychologists do fundamental research into problems of diagnosing and treating 
individuals with educational, sensory, and motor disabilities; and clinical psycholo- 
gists are specifically trained to understand these individuals and to help them through 
psychotherapy. 


6. Many or most persons with severe psychological problems also become in- 
volved in vocational difficulties, and are unable to select or to pursue suitable voca- 
tional goals. Clinical psychologists receive specific training in vocational aptitudes, 
interests, and problems, and are consequently especially well equipped to help 
psychotherapy clients with their vocational difficulties. 


7. Almost all clinicians agree that the most effective treatment of emotionally 
and mentally disturbed persons usually cannot be accomplished unless the psycho- 
therapist understands the full extent and manner of disturbance involved—unless, 
in other words, he makes an accurate diagnosis of what is troubling the client. Clin- 
ical psychologists are specifically trained and experienced in making emotional and 
mental diagnoses, through the use of psychological tests, observation, and inter- 
viewing, and are frequently consulted for this purpose when psychiatrists, social 
workers, and others who offer help with psychological problems are in doubt about 
making a definitive diagnosis. Clinical psychologists who practice psychotherapy 
are therefore particularly well equipped to understand the problems of their clients. 


8. The psychologist who practices psychotherapy today undergoes training 
which usually includes the following process: (a) He majors in psychology, or the 
science of human behavior, in college, and at the same time often minors in the other 
social sciences, especially sociology, education, and anthropology. (b) He takes a 
four-year graduate program in clinical psychology after he receives his college degree. 
In the course of his undergraduate and graduate training, he receives a thorough 
grounding in physiological, experimental, social, educational, and general psycho- 
logy. (c) He receives special training in all important aspects of clinical psychology, 
including mental testing, personality testing (with projective and non-projective 
techniques), remedial diagnosis, neuroanatomy, psychodiagnosis, psychopathology, 
clinical research, and psychotherapy. (d) During uis graduate work, he receives 
supervised training in psychological testing and psychotherapy. (e) He conducts a 
clinical research study, on the basis of which he writes a doctoral dissertation. (f) 
He obtains a minimum of a year’s internship in a mental hospital, mental hygiene 
clinic, or other psychiatric or psychological institution or agency. (g) He works for 
several years, under clinical supervision, in a hospital or clinical installation. (h) He 
undertakes personal psychotherapy and supervised psychotherapy for several years 
on a postgraduate level. (i) He frequently does postgraduate work at a university, 
medical center, psychoanalytic institute, or other facility. 

Finally, as a result of this kind of extensive and intensive psychological and 
psychotherapeutic training, the clinical psychologist is qualified to engage in the 
independent practice of psychotherapy. Not all psychologists now practicing psy- 
chotherapy, it must be admitted, fully measure up to the foregoing standards of 
training and experience. This kind of training, however, is the goal of modern clini- 
cal psychology programs and is being increasingly attained by clinical psychologists. 


9. Psychoanalysis is one of the basic areas of the science of psychology. It 
consists of a comprehensive theory of the psychodynamics of human behavior and a 
specialized method of intensive psychotherapy. From the very beginning of psycho- 
analysis, outstanding psychologists participated with Sigmund Freud in developing 
the ficld of psychoanalysis and in pioneering with techniques of psychoanalytic 
treatment. Some of the outstanding psychoanalytic theorists and practitioners, 
such as Otto Rank, Theodor Reik, and Erich Fromm, have been psychologists. 
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According to Freud (Collected Papers, Vol. V, p. 207), “psychoanalysis falls 
under the head of psychology, not medical psychology or psychology of the morbid 
processes, but simply of psychology.” Clinical psychologists today who bocome 
psychoanalysts undergo an intensive personal anal:sis, carry cases under the super- 
vision of training analysts, and participate in a program of courses and seminars in 
psychoanalytic institutes. In the course of such psychoanalytic training, they 
specifically learn to understand the influence of childhood experience on later human 
behavior, the nature of the individual’s basic character structure and system of per- 
sonality defenses, the importance of unconscious motivation on human behavior 
and symptom formation, and the role of transference in interpersonal relations and 
in the psychotherapeutic relationship. Because of their basic psychological and clin- 
ical training and experience, and their special researches into and understanding of 
individual and social interpersonal processes, ‘ical psychologists who become 
psychoanalysts are unusually well qualified to practice psychotherapy. 


SEXUAL DIFFERENTIATION ON THE DRAW-A-PERSON TEST! 


CLIFFORD H. SWENSEN? 
Veterans Administration Mental Hygiene Clinic, K: ile, Tenn 





PROBLEM 


The differentiation, or lack of it, between the male and female figures drawn by 
patients in performing the Draw-A-Person Test (DAP) is suggested as indicating 
the degree to which the subject has adequately identified himself sexually“ ®. A 
drawing in which the two sexes are clearly and correctly differentiated would pre- 
sumably indicate an individual whose sexual identification was mature and unim- 
paired, while a drawing in which there was little or no differentiation between the 
sexes would indicate an individual whose sexual identification was impaired. This 
impairment might be a reflection of general deficit, such as in the case of a psychosis, 
or it might reflect impairment that was confined chiefly to the sexual area, such as 
might be found in a case of homosexuality. 

In order to investigate more precisely the relationship obtaining between sexual 
differentiation on the DAP and various behavioral characteristics, it was felt de- 
sirable to devise a scale for rating sexual differentiation. 


METHODOLOGY 


All of the drawings in the files (101 drawings) of the Knoxville, Tennessee Vet- 
erans Administration Mental Hygiene Clinic were used. Seven judges were first 
asked to sort the drawings into five piles of equal size (about 20 drawings per pile), 
on the basis of sexual differentiation. The drawings exhibiting the poorest sexual 
differentiation would be at the one extreme, with the drawings exhibiting the best 
differentiation at the other extreme, and the others ranging between the two ex- 
tremes. The twelve drawings on which there was the greatest agreement that they 
were the poorest were then selected. The judges were asked to rank these 12 draw- 
ings for sexual differentiation. The three with the lowest average rank were chosen 
to represent the ‘‘bottom”’ of the scale. This same procedure was used to select the 
three best drawings to represent the “‘top”’ of the scale, that is, the end of the scale 
in which the sexual differentiation was the best. 


1Reviewed in the Veterans Administration and published with the approval of the Chief Medical 
Director. The statements and conclusions do not necessarily reflect the opinion or policy of the Vet- 
erans Administration. The author wishes to thank the following for suggestions, providing samples of 
drawings, and serving as judges: Drs. G. R. Pascal, C. K. Harbin, E. O. Milton, kK R. Newton, H. 8. 
Bacon, C. R. Sipprelle, J. M. McKee, A. 8. Webster, Mrs. Georgianna Jennings, and Messrs. Jack 
Byrne and Joseph Gilbert. 

*Now at University of Tennessee. 
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After the two extremes of the scale had been determined, the judges were given 
the remaining 60 drawings and asked to sort them into three equal piles. One pile 
was to be made up of drawings falling midway between the “best’’ and the “poorest’’ 
ends of the scale. The second pile was to be made up of the drawings falling toward 
the poorest end of the scale and the third pile was made up of drawings falling toward 
the best end of the scale. When this sorting was completed, the judges were given the 
12 drawings on which there was the greatest agreement as to falling midway be- 
tween the extremes of the scale and asked to select the three drawings that came 
closest to falling midway between the two extremes of the scale. The three drawings 
on which there was the greatest agreement were then selected to represent the mid- 
point of the scale. This same procedure was used to select the three drawings that 
fell exactly between the poor end of the scale and the mid-point, and to select the 
drawings that fell exactly halfway between the mid-point and the best end of the 
scale. 

When the seven judges had completed their work, there was a scale consisting 
of five points, with three drawings serving as examples of each point, giving a total 
of 15 drawings in the scale. The judges were then given the drawings and asked to 
write down the characteristics of the drawings, on each point on the scale, that 
seemed to differentiate between the male and female. The judges’ descriptions 
were condensed and used to describe each point on the scale. The scale was then 
expanded, with five points with descriptions and examples (points 1, 3, 5, 7, and 9) 
and four points (points 2, 4, 6, and 8) to be used for rating drawings that appeared to 
fall midway between the defined points. 


THE ScALE® 


The five described points, and one drawing used to illustrate each point are 
presented below: 


Point 1. Little or no sexual differentiation. 


There is little or no difference between the two figures, and what difference 
exists between them does not particularly suggest sexual differentiation. 


Point 8. Poor sexual differentiation. 


Longer hair on the female than on the male. There may be a slight sug- 
gestion of difference in body contour and /or clothing. 


Point 6. Fair sexual differentiation. 


The female definitely has longer hair than the male. The female clearly 
has different body contour, with either rounded hips or breasts, or both present. 
There may be a suggestion of difference in the clothing of the pair. 


Point 7. Good sexual differentiation. 


The female has longer hair than the male. The female has rounded body 
contour, the male has angular body contour. Breasts and/or rounded hip: 
present, with both usually present. There is a clear difference in clothing, with 
the female wearing feminine apparel, although the apparel may be copied after 
that of the male, e.g. slacks. There may be the suggestion of differentiation in 
minor details, such as eyelashes or fuller lips on the female. 


Point 9. Excellent sexual differentiation. 


Female hair is longer than male hair, with definite feminine hair styling 
in the female. The male body has angular contour, the female body has rounded 
contour with both rounded hips and breasts present. The male wearing clothing 
that is definitely masculine, the female wearing clothing that is clearly feminine. 
Minor details, such as eyes, mouth, earrings, bracelets, etc., clearly appropriate 
for the sex of the figure on which they are drawn. 


8A multilithed copy of the entire scale may be obtained by writing to the author. 
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Figure 2 
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In order to get an estimate of the scale’s reliability, two independent judges were 
asked to use the scale to rate 58 sets of drawings randomly selected from the files of 
the Knoxville VA Mental Hygiene Clinic. Neither of the judges had seen any of the 
drawings or the rating scale previous to this. Their ratings correlated .84 with each 
other. This appears to indicate fairly good reliability for the scale. 


Figure 1. Pornt 1 ON THE SCALE. 
Ficure 2. Pornt 3 ON THE SCALE. 
Ficure 3. Pornt 5 ON THE SCALE. 
Ficure 4. Point 7 ON THE SCALE. 
Figure 5. Pornt 9 ON THE SCALB 


Discussion 

One might ask whether or not good or poor sexual differentiation on the draw- 
ings might not merely reflect general deficit, or lack of it in the area of visual-motor 
functioning. If this were true, one might expect differentiation to correlate rather 
highly with some other similar motor task, such as performance on the Bender- 
Gestalt. Thirty-eight cases randomly selected from the files of the Mental Hygiene 
Clinic were used to determine the relationship between Bender performance, and 
differentiation on the DAP. Using the rating scale for rating the drawings, and 
Pascall’s scoring system“) for scoring the Bender, a correlation coefficient of .24 
was obtained. This is not significant at the .05 level, and would indicate that sexual 
differentiation on the DAP is not a measure of the same variables as are measured 
by the patient’s general performance on the Bender. 

In an effort to obtain a rough idea of the sort of distribution that might be ob- 
tained from hospitalized patients, and from patients in an out-patient clinic, a dis- 
tribution was made of the ratings of the 58 patients used in the original reliability 
study of the scale, and a distribution was made of 19 patients recently tested on ad- 
mittance to the Eastern State Hospital, Knoxville, Tennessee. This comparison is 
presented in Table 1. It will be noted that the out-patients tend to be distributed 


TABLE 1. CoMPARISON BETWEEN [N-PATIENTS AND OvT-PATIENTS FOR 
SexvaL DIFFERENTIATION ON THE Draw-a-PERSON TEST. 








Rating In-Patients Out-Patients 





Poor (1-3) 14 22 
Good (4-6) + 24 
Excellent (7-9) 


1 12 
Totals 19 58 





Chi-square 7.15 significant at .05 level 





rather evenly over the scale, whereas the in-patients tend to concentrate at the low 
end of the scale. Using chi-square to determine the significance of the difference, it 
was found that the difference was significant at the .05 level. It might be suggested 
that this was due, in part, to the fact the scale was originally derived from drawings 
by out-patients, and that one would expect to get a normal distribution from out- 
patients, a clustering at the low-end from in-patients, and a clustering at the high 
end from normals. At any rate, this rather inadequate step toward developing 
norms for sexual differentiation on the DAP provides tentative evidence indicating, 
as might have been expected, that hospitalized patients demonstrate considerably 
more deficit in this area than out-patients. 

The rather inadequate data presented here give a very sparse hint of the poss- 
ible significance of sexual differentiation on the DAP, and point up the need for more 
adequate and more extensive norms obtained from large numbers of normals, and 
in-patients and out-patients of varying diagnoses, and more extensive studies to 
determine the significance of sexual differentiation. And, although data presented 
here give some idea of the reliability of the rating scale, they do not give any idea 
of the reliability of the drawings themselves. This should be studied. 
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SUMMARY 


A scale for rating sexual differentiation on the Draw-A-Person Test was con- 
structed. Reliability study showed the scale to have a reliability of .84. Sexual 
differentiation, as measured by the scale does not appear to correlate significantly 
with another, similar, visual-motor test, the Bender-Gestalt. Hospitalized mental 
patients have significantly poorer sexual differentiation on the DAP than patients 
being treated in an out-patient clinic. 
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INVESTIGATIONS INTO SELF-CONCEPT: 
II. STABILITY OF REPORTED SELF-IDENTIFICATIONS 


JAMES F. T. BUGENTAL! AND EVELYN C. GUNNING 
University of California, Los Angeles 


PROBLEM 


Within the area of personality theory and investigation, the Self-Concept has 
proven a particularly helpful construct for organizing discussion “: ® 7, % 1, 11, 13, 14, 
and others)’ While some investigations®: ™ °e) have been reported of the manner 
in which the self-concept appears to be reflected in the therapeutic situation, relative- 
ly little has been demonstrated about the nature of self-perceptions (and thus, by 
implication, of the self-concept) in less selected subjects. The present series of 
studies: 4 8 '5) attempts to develop some exploratory leads toward the lines along 
which generalized description of self-perception may be made. 


PROCEDURE 


Administration. The W-A-Y or “Who Are You?” technique is administered in 
the following manner, whether to groups or individuals: 

The subject is given a piece of paper and is told: “I am going to ask you a ques- 
tion and I want you to write three answers to the question on this paper. Your 
answers may be anything you wish: words, phrases, sentences, or anything at all, 
so long as you feel satisfied that you have answered the question three times. Re- 
member you are to give three answers. The question is: ‘Who are you?’ ”’ Any ques- 
tions asked by the subject are answered by repeating the instructions in whole or part. 

One of the first concerns in the use of the responses thus secured was to find 
dimensions along which to conduct analyses. In the previous study ®, based on 


Now with the Psychological Service Associates, 1355 Westwood Boulevard, Los Angeles 24, 
California. 
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group administered responses, a system of categories was established by a prelimin- 
ary review of several hundred responses. Each of two workers went through these 
returns, preparing lists of possible categories. These lists were then exchanged and 
tried out by the other worker. Finally both worked together criticizing and revising 
the combined lists. The adequacy of the system of categories which survived this 
selection process was tested, and it was found to be relatively stable for groups hav- 
ing only moderate differences, even when comparisons were made between sexes and 
among age levels. When replies of strongly contrasting populations were examined 
the system of dimensions was found to be responsive to these greater differences. 
The present article provides information based on further use of the W-A-Y 
but with the following differences from the earlier paper: (1) all responses were 
secured in individual interviews rather than from group administrations, (2) the 
overworked experimental animal, the college student, was not used as subject, and 


(3) additional information was secured on the respondents making possible further 
analyses.* 


Collection and tabulation of data. The responses used in the present report were 
secured by senior and graduate students enrolled in a course in interviewing. The 
students were not permitted to use persons already known to them. Each student 
submitted an extensive report on his interviewing, and these were carefully read 
before the responses he obtained were selected for use in the study. Two separate 
samples, obtained about three months apart, were collected in this manner. In the 
first sample responses from over 250 persons were available, of which 100 were 
selected as being secured under optimal conditions, and having all required informa- 
tion reported. In the second sample (collected by more carefully instructed inter- 
viewers), responses from approximately 90 persons were available, out of which 60 
were chosen. 

Each response was tallied in a content category similar to those used with the 
group responses, with some refinements necessary for statistical precision. A minor 
addition to the administrative procedure called for the subject to indicate that word 
which he felt to be essential to each of his answers. The content category to which 
each response was assigned was thus determined by giving primary attention to this 
key word. The twelve categories employed are described below. 

— Name: Either the whole name, a nickname or initials (e.g., “Mary Jones”, “Ed’’, or “T. 

‘amily: Conjugal family (e.g., ‘“Wife of Corky’’) or consanguine family (e.g., “Son of my 

arents”’ 

, oc, Positions of status or identifying with a social grouping (e.g., “President of Rotary” 
or ‘Mason 

Oomapation: Aspirations for occupation also included (e.g., “Bricklayer” or “Going to be an 
accountant’). 

Age: Named or adjectival ages (e.g., “Thirty” or “Old’’). 

Sex: (e.g., “Girl”, “Woman”, or “Male’ 43 

Group Membership: Either a "generalized term (e.g., “Resident of California’’) or one of a 
group (e.g., “Democrat” or “White man’’), without distinguishing the individual. 

Unit: Points out uniqueness (e.g., “Individual mind among millions of minds’). 

Positive Affect: Any positively toned response (e.g., “Trying to do good”’ or “Creative’’). 
¥ be oe ot Ambivalent Affect: Negatively and ambivalently toned responses (e.g., “Ugly” 
or 1xed-up ). 
Other: Teihetes all responses that do not fit into categories listed above (e.g., “Sports fan’’). 
Double: This is a formal, not a content, category. Any response which duplicates (in cate- 
gory) a previous response of the same person is tallied as a “double’’. 


The percentage frequencies by category for each sample are shown in Table 1, 
along with the total percentage frequencies when the two samples were combined. 
Whole numbers were used in expressing these data since they were felt not to war- 
rant extensively refined statistics. 


*The authors would like to stress that the W-A-Y is still an experimental method and is in no 
sense being offered as a new projective technique or psychometric. 
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JAMES F. T. BUGENTAL AND EVELYN C. GUNNING 


FINDINGS 


Reliability. The two separately drawn samples were contrasted in order to in- 
vestigate the reliability of the frequencies (in terms of proportions) with which the 
categories are employed. Critical ratios were obtained for each of the twelve com- 
parisons between the two samples by means of the Edgerton and Peterson Tables. 
These ratios indicated a high order of stability. Not even the 5% level of confidence 
was reached in any of the comparisons. It would appear that adequate reliability is 
demonstrated in the categorizing system employed. 


Category adequacy. Not only do the category frequencies appear to be quite 
reliable, but each, in itself, shows significantly frequent use to warrant its inclusion 
in the analysis plan. All frequencies of the total sample when treated as proportions 
were well over three times their standard errors, except for the Age category. This, 
with a critical ratio of 2.9 is significant at the 1% level.’ 


Category usage. Attention may next be directed to the relative frequencies with 
which the several categories are employed by the respondents. The greatest per- 
centage frequencies consistently obtained were in the Name (56%) and Occupation 
(55%) categories. The Group Membership category was mentioned next most fre- 
quently (483%). The least frequently employed of the categories were Age (6%) and 
Negative Affect (7%). When frequencies for positive, negative, and ambivalent 
affective responses are combined, the resulting Total Affect shows a pattern similar 
to that found when the categories are treated separately. This is also true for all 
the comparisons below. 


INFLUENCES OF SOCIOLOGICAL DIMENSIONS 


While category frequencies remain generally constant for discrete random 
samples, it will be interesting to observe the influence of contrasting these frequencies 
when various subgroups within a sample are compared. Such contrasts may be made 
in terms of characteristics of the respondents along what might be termed “‘sociolo- 
gical dimensions,’ such as age, sex, marital status, education and occupation. The 
results of such inquiries are reported in the following paragraphs and in Table 1 for 
the sample of 100 persons. 


Age. The first point to be investigated is whether the content of the self-percep- 
tion is a function of the age of the perceiver. The sample was broken into three 
groups roughly thought to correspond to youth, early middle age, and maturity: 43 
persons ranged from 18 to 28; 33 persons from 29 to 40; and 24 respondents ranged 
upward from 41 through the sixties. The method used is that of noting the critical 
ratios between the standard errors for the proportions in each age group. It is found 
that, of the 36 comparisons made, none attains the three sigma level, and only two 
critical ratios are even significant at the 1% level. In the Family category it was 
found that the members of the middle aged group state their family relationship 
more frequently than those of the youngest group, and the youngest group members 
tend to state their Sex more frequently than the middle group. Two other critical 
ratios are significant at the 5% level: the Family category and the Name category 
are used more frequently by the middle age group than by the older group. 


Sex. The sex of the respondent seems to have somewhat more significance in 
determining his reported self-perceptions than does his age. This was not found to 
be true of the college students in the earlier report®. Two of the twelve critical 
ratios comparing the sexes are significant at the 1% level: females give their Family 


*This step, which was not adequately described in the earlier publication ®), is designed to provide 
some objective basis for determining a lower “cut-off point” of frequency for category utility. Thus, it 


was decided that any category must show a frequency at least 1.96 times its standard error to warrant 
its continued use. 
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status more, and males their Growp Membership. One ratio is significant at the 
5% level, the women tell their Ages more frequently than do the men. 


Marital status. Since the family category has shown the most consistent variation 
thus far, it might be expected that a comparison of the married and unmarried 
respondents would show marked differences throughout the categories. This does 
not occur, however, and the impression of essential stability of the classification sys- 
tem seems to be reinforced. For this comparison, ten persons from broken homes 
were dropped from the sample; 31 of those remaining were unmarried, and 59 were 
married. Two of the twelve critical ratios attain the 1° level and one the 5%. These 
respectively indicate: the married give their Family status appreciably more fre- 
quently (critical ratio 4.9) as they do also their Social positions. The unmarried, on 
the other hand, see themselves more frequently in terms of Group Memberships. 


Education. Interestingly, educational level seems to have practically no in- 
fluence upon the reported self-perceptions. Forty-three subjects attended high school 
or trade school; while 48 had at least some college experience. (Nine subjects with 
no education beyond grade school were dropped from this comparison). No critical 
ratio reaches the 1% level and only one the 5%. That one shows the college group 
giving their Ages with greater frequency. 


Occupation. Finally we may turn to the differences found between occupational 
groups. Three groups are contrasted here. The first, having 42 cases, consists of 
professional and white collar employees; the second is made up of 27 industrial and 
laboring persons; and the last consists of 31 persons without regular “outside’’ em- 
ployment (housewives, retired, unemployed, and four students who somehow crept 
in). The professional and white collar group gives Age in greater frequency than does 
the industrial (significant at the 1% level). Those not regularly employed give 
Family status more than the professional; however, this difference (significant at 
the 5% level) is probably a product of the inclusion of the housewives in the former 
group. These same group members (those not regularly employed) give more 
Positively toned responses and more Duplications (both significant at the 5°% level) 
than the professional and white collar workers. It is interesting to note that no differ- 
ence between the industrial and laboring group, and those not regularly employed 
attains even the 10°% level of confidence. 


SUMMARY 


The findings have been reported in terms of the relative stability of category 
frequencies. This orientation arises from the interest in locating the more generally 
consistent aspects of self-perception. By way of summarizing these inquiries, it may 
be noted that of the 108 comparisons made, only 7 (approximately 6%) attain the 
1% level of significance and an additional 8 (approximately 7°) attain the 5% level 
of significance. In other words, it seems that the categories do have consistency. In 
terms of the influences of the sociological dimensions, the respondent’s sex and 
marital status seem most important. Each shows 25°% of the categories varying 
significantly. Age and occupation prove only about half as influential (11% each), 
while education is apparently least (8°). 

From the point of view of the individual categories, Family is the most subject 
to variation, with five out of the nine comparisons in which it was concerned yield- 
ing significant differences. Age is next (with three), then Growp Membership (two). 
Five other categories show but one significant comparison each (Name, Sex, Social, 
Positive Affect and Duplications). Four categories yield no significant differences in 
any of the comparisons (Occupation, Unit, Negative and Ambivalent A ffect, and Other). 

It can thus be seen that when analyzed the body of responses to the W-A-Y 
has certain stable characteristics in terms of being describable in a limited number of 
content categories. This description remains relatively stable even when the total 
body of responses has been broken into subdivisions and contrasted in terms of 
certain sociological dimensions. 
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AN INVESTIGATION OF JUVENILE PROFILES ON THE SZONDI TEST* 


EDWARD M. SCOTT 
a State Hospital 


‘endleton, Oregon 


INTRODUCTION 


The Szondi Test was originally conceptualized to support the theory that mental 
disorders have a genetic origin in latent recessive genes. These same genes presum- 
ably influence an individual’s spontaneous choice reactions. Deri® formulated a 
different hypothesis that personality consists “of a number of need-systems (or 
drives)’’ with the Szondi test reflecting the distribution of these drives and how they 
are handled by an individual. Deri’s contribution is readily accepted by Szondi® 
for he states in the Forward to Deri’s book that the ‘dynamic thought processes” 
and “drive factors’ have augmented the value of his test. 

Various workers have subjected Deri’s hypothesis to experimental investiga- 
tion. Fosberg®, Lubin and Malloy“®), Mussens and Krause“), Goldman ®?, 
David and Rabinowitz“, Borstelman and Klopfer®, and Guertin®? have, on the 
whole, failed to confirm Deri’s assumptions. Whereas, Harrower“ and Bromberg 
and Cyerfa®) seem to substantiate Deri’s assertions. 

The present study was undertaken on a juvenile population to investigate the 
following hypotheses: (1) Are Deri’s assumptions regarding delinquency verified? 
(2) Is there a significant difference in personality between delinquents and non- 
delinquents as reflected by the Szondi? (3) What adolescent sex differences appear 
on the Szondi? (4) Does the Szondi reflect religious influence? 


PROCEDURE 


The test was individually administered, with good testing conditions, to 1200 
adolescents between the ages of thirteen and nineteen. The population included 600 
delinquents equally divided into groups of boys and girls. A delinquent, as defined 
in this study, consists of a juvenile between the ages of thirteen and nineteen who has 
come before a court for some misdemeanor and was adjudicated. The non-delin- 


*The original study, of which this article is a brief resume, was submitted to the Psychology De- 
partment, University of Portland, in partial fulfillment for the degree of Doctor of Philosophy. 
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quent population also comprised of 600 adolescents equally divided into 300 boys 
and 300 girls. This was further divided into 150 boys and 150 girls who attended 
Catholic high schools and 150 boys and 150 girls who attended public high schools. 
All subjects were from the states of Oregon and Washington. The mean ages of the 
subjects were sufficiently close enough to indicate a homogeneous population. 
After the scores were tabulated according to the usual method (+, -, A, O) 
of scoring, each major group was tested by the Chi-Square technique, following the 
procedure of Cohen®). Next, as described by Edwards“ the Chi-Square method 
“applied to more than two groups” was utilized to determine greater than chance 
variation among the various sub-groups for the eight factors. Lastly, the following 
formula from Guilford “* was employed: 
t= P, — P, where , 


oP, — P; P, — Pp =/ Pi: Qi + P2Q 
Ni Ne 








P, — Ps 


P; Qi + Pe Q: 
Ni N2 








This formula was applied to the various factors for those sub-groups in the significant, 
or nearly significant, Chi-Squares. The nearly significant Chi-Squares were found 
occasionally to have more than chance differences between certain inner-fluctuations. 


RESULTS 


When the population sections were compared with chance occurrence by the 
Chi-Square method, factors H, 8, Hy and K always appeared significant. Factors 
P, D and M failed to reach significance only once, while the E factor failed to differ 
from chance five times. The table for Chi-Squares of compared sections of the popu- 
lation according to Szondi factors is not here recorded. In summary, some significant 
results were obtained between every matched group; and all the factors, except E, 
appeared significant in some of the compared groups. Sex differences appear to ac- 
count for the greatest number of differences. 

Table 1 presents the t-ratios between the match-sections of the population to 
determine which inner-fluctuations of the factor were significant. Three points de- 
serve mention: (1) Although some of the Chi-Squares did not quite reach statistical 
significance, some f-ratios were significant. To suggest interpretations as to the 
meaning of these particular t-ratios would be highly speculative. (2) Not all of the 
Chi-Squares were computed because some obviously would not result in significant 
t-ratios. (3) Some of the significant ‘‘t’s” at the 5% level may appear by chance. 


Discussion 


The conclusions of this study are mainly derived from Table 1. Following is a 
summary of our findings related to Deri’s basic assumptions with which a close ac- 
quaintance is presupposed. 


The H Factor. 


+H. Deri’s assertion that +H implies the need to be the recipient of love and hence is 
“more characteristic for women than for men” lacks confirmation. The total boy population 
when compared with the total girl population, and non-delinquent boys when matched with 
non-delinquent girls, gave a significantly higher +H record. 

-H. No significant trends. 


Open H. Deri’s hypothesis received possible support since delinquent girls and non-de- 
linquent girls when compared with delinquent boys and non-delinquent boys, respectively, 
scored significantly higher. Perhaps girls can sublimate better than the boys or boys have 
“stronger” sex impulses. 

Ambivalent H. The total boy population when matched with the total girl population scored 
high, hence Deri’s prediction received partial confirmation. 
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The S Factor. 
+S. In general Deri’s assertions hold true, since girls never score higher than boys; and 
delinquent girls when compared with non-delinquent girls reveal significantly higher scores. 
hich Once again, Deri’s hypothesis appears substantiated, for girls scored significantly 

er. 

Open S. The results are lacking in consistency since delinquent girls when matched with 
delinquent boys, and the girl population when compared with the boy population, yielded 
higher scores. The findings might suggest that aggressive tendencies aren’t as “strong” in 
girls, and can therefore be more easily sublimated. 
Ambivalent S. Deri’s prediction is supported. 


The E Factor. Deri’s interpretation of this factor is highly questionable since none of the Chi- 
Squares were significant, and only one “?”’ appeared significant, perhaps by chance. 

The Hy Factor. 
+Hy. This factor didn’t appear significant in any of the compared groups. 
—Hy. The fact that boys always outscored the girls might imply that boys, as a group, are 
less demonstrative of their feelings. To be noted also, is the high score of non-delinquent girls 
when matched against delinquent girls. Better control (super-ego) as Deri asserts coul 
the rationale. 
Open Hy. Why Catholic boys outscored the public high school boys remains to be answered. 
Ambivalent Hy. The somewhat frequent high scores of the girls might indicate, as Deri 
asserts, their tendency for hobbies, mannerisms, etc. 
D Factor. 
+D. The results were inconsistent. Of interest is the fact that public high school girls 
when matched with Catholic high school girls scored significantly higher. This might imply 
that religious training has instilled in the latter a “will” to forego material objects and the 
consequent “comfort” derived from them. Hence the conclusion agrees with Deri, but the 
major premise differs markedly. Secondly, Deri identifies the +D individual “with anal 
characteristics.” She also asserts a “close link between the +S and the +D characteristics.” 
The present study fails to establish such a connection for the boys scored high in +S. 
-D. Deri equates —D with non-aggressive behavior. The present study contradicts this 
since boys gave higher scores. 
Open D and Ambivalent D. These yielded conflicting results, most of which failed to meet 
Deri’s predictions and assumptions. 
M Factor. 
+M. The delinquents failed to score high, as a group, in +M. Hence Deri’s predictions 
appear justified, for “‘+M is rather a counter-indication for anti-social behavior.” The fact 
that girls frequently score high in +™M can possibly be explained on the supposition that they 
are more sociable and mature earlier than boys. 
-M. The present findings contradict Deri’s assumption, for she suggests that anti-social 
groups would select —M. 
Open M. According to the present investigation open M is more apt to predict a delinquent 
than Deri’s assertion (-M), especially when the individual taking the test is an adolescent boy. 
Ambivalent M. No consistent pattern emerges. 


The P Factor. 

+P. The one significant finding could be due to chance. If not, idealism might be a reason- 
able supposition. 

-P. Deri maintained that —P would be found in anti-social groups and that —P is character- 
ized by projection implying a lack of conscious connection between daily activity and deeper 
underlying emotional needs. The implication of the present study, since the total delinquent 
population significantly outscored the non-delinquent population, as well as delinquent 
girls when compared with non-delinquent girls, is far reaching and Deri’s assumptions ap- 
pear to be confirmed. 


Open P. Probable confirmation is “ae since non-delinquent boys gave significant 
records when matched with delinquent boys 


Ambivalent P. Deri’s hypothesis is probably neither confirmed nor contradicted. 

The K Factor. 
+K. In part, the results could imply that girls are more likely to sublimate than boys. 
-K. The results are contrary to Deri’s contentions, for she stated “standards and value 
judgments are readily accepted.” This can hardly be true of delinquents. The remaining 


significant score of the boys is rather difficult to explain. Possibly it signifies that stronger 
‘Sd-demands”’ are “felt” and not “wanted.” 


Open K. The one significant occurrence is opposed to Deri’s prediction. 


Ambivalent K. Deri’s assertions appear to be verified since the total delinquent girl popula- 
tion yielded higher scores when matched with the total delinquent boy population, for here, 
anti-social activity does “not involve physical force.” 
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CoNCLUSIONS 


The Szondi Test was individually administered to 1,200 adolescents (600 boys 
and 600 girls) between the ages of thirteen and nineteen. This population was equal- 
ly divided into delinquents and non-delinquents. The latter group was further 
divided into public and Catholic high school students. The present study was under- 
taken to investigate by means of the Szondi: (1) Deri’s assumptions regarding de- 
linquency, (2) personality differences between delinquents and non-delinquents, 
(3) adolescent sex differences, and (4) the test’s ability to reflect religious training. 


Appropriate statistical procedures were employed to determine significant differ- 
ences. 


1. Deri’s prediction that delinquency would be indicated by —E and -Hy, 
and —M fails to receive confirmation. In fact, non-delinquent girls significantly out- 
scored delinquent girls in-Hy. The present study suggests that delinquent boys have 
a tendency for open M and —K; whereas delinquent girls are more apt to give higher 
+8 and —P choices. +M is, as Deri asserted, a counter-indication of delinquency. 
The present investigation confirmed this, especially if the adolescent is a girl. 


2. Regarding personality, delinquents seem to need more affection (+H), are 
less apt to recognize their daily needs and deeper underlying emotions (—P), and 
hence there is a tendency to compensate by oral activity (open M). Non-delinquents 
appear better adjusted socially (+M). 


3. Sex differences are clearly indicated. In the S factor most of Deri’s assump- 
tions receive confirmation, especially +S and -S. Deri’s supposition for the H 
factor is rather questionable. +H, contrary to her prediction, yielded significant 
results in the boy population. —H did not appear significant in any of the matched 
groups. The present investigation might confirm her assumptions for open H and 
ambivalent H. A consistent rationale of the other factor Tatian: Why boys score 
so high in —Hy, —D, open M, and —K, and why girls yielded significant scores in 
+D, ambivalent D, ambivalent M, +K, and ambivalent K based on Deri’s assump- 
tions is rather difficult to explain. 


4. The Szondi test failed, to any great degree, to reflect religious influence. 
A possible exception might be that Catholic boys are less overtly aggressive because 
of their training; the public high school boys have a higher +8. In the girl popula- 
tion, a trend emerges in the D factor but Deri’s hypothesis is a questionable one as 
far as the present investigation is concerned. 
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SOME VARIABLES RELATED TO OUTCOME OF PSYCHOTHERAPY 
JEROME K. MYERS AND FRANK AULD, JR.! 


Yale University 


PROBLEM 


The selection and validation of criteria to measure psychotherapeutic experience 
are important problems in current research. In an increasing number of studies the 
utility of psychological tests as guides to selection of suitable patients for psycho- 
therapy is being investigated“: ?» 3." 7), Although the validation of such tests rests, 
to a large degree, upon the adequate measurement of the success or failure of the 
patient’s therapeutic experience, insufficient research has been directed towards this 
area. Recently, considerable interest has been focused on the relationship between 
the social class backgrounds of patients and their experience in therapy “> ®). Again, 
the problem of rating such experience arises but has not been solved. 

In many studies length of treatment has been used as a measure of psycho- 
therapeutic experience. For example, Kotkov and Meadow? and Rogers, Knauss, 
and Hammond? utilized this criterion in their research on the selection of suitable 
patients for psychotherapy by psychological tests. Schaffer and Myers also selected 
duration of contact to rate therapeutic experience in their study of social class and 
treatment in a psychiatric out-patient clinic® *. In addition, the experience and 
amount of training of the therapist has been suggested as being related to psycho- 
therapeutic experience: 5 ®), 

Using length of time an individual stays in treatment and experience and 
training of the therapist as measures of therapeutic experience is based upon the 
common belief that psychotherapy requires relatively long and intense contact be- 
tween a well-trained and experienced therapist and patient. ». 307; 8 p. 55) Al- 
though a lengthy contact does not necessarily mean a successful one, chances are 
considered better than in a shorter therapy. Certainly, the probability of a brief 
treatment being successful is considered minimal. Chances for a successful therapy 
are also considered greater if the therapist is experienced and highly trained. Since 
the validity of these general assumptions is not established, more research is needed. 
What is the relation between the length and outcome of therapy? Is there a signifi- 
cant relationship between duration of contact and the success or failure of treat- 
ment? Is it justifiable to rate a psychotherapy by an experienced psychiatrist differ- 
ently from one conducted by a resident? 


PROCEDURE 


The present paper reports on an empirical investigation of the relationship be- 
tween the manner in which therapy is terminated and (1) length of treatment 
(measured by number of interviews) and (2) training and experience of the therapist. 
It is based on cases studied by Schaffer and Myers in their investigation of the role 
of social class in therapeutic experience in a psychiatric out-patient clinic © *». They 
studied all the cases that came to the institution, a training and community clinic 
where treatment is oriented around expressive psychotherapy, from October 1, 1950 
to September 30, 1951. The present investigation is limited to the patients seen by 
the senior staff and resident psychiatrists. Since medical students, the other large 
group of therapists, were serving a four-week clerkship, their patients were omitted 
because of the artificial time limits set on their length of therapy. 

The present authors examined the records of all patients seen during the vear 
by the senior staff and resident psychiatrists to determine the manner in which 


1From the departments of Sociology and Psychology, aided by the following research grants: 
“The Relationship of Social Structure to Psychiatric Disorders,” A. B. Hollingshead and F. C. Red- 
lich principal investigators, U. 8. P. H. S. Mental Health Act Grant MH 263, and “Development of 
Quantitative Methods for Detailed Study of Psychotherapy,” J. Dollard principal investigator, 
U.S. P. H. S. Mental Health Act Grant M-648. 
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therapy was terminated. Of the original 65 cases, two were omitted from the study. 
In one case the record could not be located, and in the other, the patient died during 
treatment. The manner in which therapy was terminated was not noted officially 
at any specific place in the record, but it was possible to determine it by examining 
the entire record. The following four categories of termination emerged from our 
analysis: 

.. Patient quit. Patient terminated therapy without the approval of the therapist. The 
therapist felt further therapy was required and advised against termination if consulted. In 
many cases, however, the patient sim Ba quit by failing to keep an appointment. 

2. Patient unimproved. The patient was discharged by the therapist as an unsuccessful 
case. The doctor felt the therapeutic relationship was unsatisfactory and that the patient was 
not able to profit from it. In essence, the therapist gave up on the case, even though the patient 
may have desired to continue. 

3. Patient improved. The treatment was terminated by the therapist on the grounds that 
the patient was sufficiently improved to require no further treatment at the time. 

4. Therapy continued. The patient continued in treatment elsewhere. He either trans- 


ferred to a private practitioner, because an increase in his income excluded him from the clinic, 
or he moved out of the community and entered treatment elsewhere. 


We did not make any check on the reliability of our classifications, since it would 
give a spurious impression of objectivity to study reliability of classification when 
we have no way of testing the reliability of the raw record. However, we offer the 
reader examples of statements in the record indicating differing types of termination: 

1. Patient quit. At the end of each case record there was a summary sheet about the date 
and nature of each appointment. If the patient quit it was listed here, under an entry such as 
“Patient called to break her appointment because she could not get time off work. She said she 
would call back for another appointment, but never did.” “Patient cancelled appointment by 
phone and never called back.” “Patient failed to keep his appointment.” 

2. Patient unimproved. Therapist terminated treatment for reasons like the following: 
“In view of the very difficult circumstances in which the patient lives, it seemed that psycho- 
therapy would not have much chance.” “. . . therapy discontinued on my initiative inasmuch as 
he seemed to have reached an impasse.” 


Patient improved. One therapist stated, “I feel she can handle her problems with con- 


siderably more freedom than when she came. She seems to have benefited from therapy.” An- 


other wrote, “Her symptoms were improved. I told her to call me in the future if she ever becomes 
upset.” 


4. Therapy continued. “Arrangements were made that I would see him i in private practice.” 
“Arrangements were made to have her seen in private practice by Dr. - - -.’ 


RESULTS 


The relationship between length of treatment and the manner in which it was 
terminated is shown in Table 1. It is clear that the fewer the number of interviews, 
the greater are the chances of failure. Seventy-nine per cent of patients seen fewer 
than 10 times either quit or were discharged by their doctors as unsuccessful cases, 
but only 15 per cent of those interviewed 20 or more times concluded their treatment 
in such a manner. Similarly, the percentage of patients discharged as improved in- 
creased with the duration of contact, jumping from 0 to 43 as the number of inter- 
views rose from fewer than 10 to 20 or more. 

The relationship between the training and experience of the therapist and treat- 
ment outcome is shown in Table 2. It is clear that the more experienced and better- 
trained senior staff therapists tended to have more successful terminations and fewer 
failures than the residents, when only cases seen ten or more times are considered; 
under this length of time training and experience are not related to outcome.? Thus, 
it is apparent that both length of therapy and training and experience of the thera- 
pist are related to the manner in which therapy is terminated. Brief therapies (under 
10 interviews) tend to end unsuccessfully regardless of the training and experience 
of the doctor. More lengthy treatment tends, in general, to be more successful, 
especially if more highly trained practitioners are involved. 


*The senior staff was composed of psychiatrists with the rank of instructor or assistant professor, 
who had several years of experience beyond their residency. The resident psychiatrists were in their 
second year of a three-year training program. 





SOME VARIABLES RELATED TO OUTCOME OF PSYCHOTHERAPY 


TaBLe 1. LENGTH AND OvuTCOME OF THERAPY* 








Number of interviews 





Manner in which 10-19 20 and over 
therapy was terminated Per Per 


No. Cent | No. Cent 
Patient quit 6 33 2 10 








Therapist discharged 
patient as unimproved 17 





Therapist discharged 
patient as improved 89 42.6 





Therapy continued 
elsewhere 5 21 2 11 9 42.6) 16 








Total 24 100 18 100 21 100 63 100 














*X? = 14.2 P<.001. In the computation of chi square the categories “10-19” and “20 and 
over” were combined, as were the categories “patient quit’? and “therapist discharged patient as 
unimproved.”’ The category “therapy continued elsewhere” was omitted. 


TABLE 2. EXPERIENCE OF THERAPIST AND OUTCOME OF TREATMENT 
(10 oR MORE INTERVIEWs)* 








Staff Resident 
Manner in which 


therapy was terminated Per 
No. Cent 


Patient quit 8 28.6 


Therapist discharged 
patient as unimproved 11 


Therapist discharged 
patient as improved 82 

















Therapy continued 
elsewhere 8 28.6 





Total ll 100 28 100 











*The exact test for association in a 2 x 2 table was employed and gave a p of .0483. In the com- 
putation the categories “patient quit”’ and “therapist discharged patient as unimproved” were com- 
bined, and the category “therapy continued elsewhere” was omitted. 


DIscuUssION 


Several crucial questions are raised by the results of this study. Most important, 
does the manner in which therapy is terminated measure success and failure? Ob- 
viously, the outcome reflects to a large extent the views of the therapist. If the pa- 
tient quits when the therapist feels further treatment is necessary or if the patient is 
discharged as unimproved, the case is a failure by the therapist’s definition. By the 
same token, the case is successfully terminated if the patient is discharged as im- 
proved. Whether or not objective criteria would indicate successful outcome for 
patients discharged as improved is not known. 

What is the role of individual differences among therapists in the definition of 
“success” and “failure’’? Unfortunately, we did not have enough cases for individ- 
ual therapists to investigate this. However, since there were 12 therapists in the 
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study, each of whom carried from 1 to 10 cases, the factor of individual differences 
should be somewhat averaged-out. However, the differences in the manner of term- 
ination between the senior staff and residents may be due to different conceptions 
of improvement based on differential experience and training. For example, are the 
senior staff more likely to discharge a patient as improved if the therapeutic process 
breaks down, rather than have the patient become disinterested and quit? Are the 
residents reluctant to discharge patients as improved? Or are the differences be- 
a the outcomes real ones, due to greater training and experience of the senior 
staff? 

There may also be differences between senior staff and residents in the selection 
of cases. For instance, it is possible that the senior staff on the basis of clinical ex- 
perience select cases more likely to be successful and leave the ‘‘poor” cases for the 


residents. Clearly, questions of this sort need to be investigated through empirical 
research. 


SUMMARY 


We have presented evidence to support the following two hypotheses: (1) 
Length of psychotherapy is related to the manner in which treatment is terminated, 
and (2) Experience and training of the therapist are related to treatment outcome. 
Specifically, we found that the longer the duration of contact and the greater the 
training and experience of the therapist, the higher the patient’s chance of being dis- 
charged from treatment as improved. Whether these results would hold for other 
clinies or other definitions of outcome of therapy is not known. This must be de- 
termined by further studies. 
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AFFECT IN THE ETIOLOGY AND MAINTAINENCE OF MENTAL 
DEFICIENCY?! 


MARION WHITE MC PHERSON AND RALPH I. FISCH 
The Neuropsychiatric Clinic of St. Louis U. S. Army, Camp Crowder 


PROBLEM 


Mental deficiency is one area of behavior in which a priori assumptions regard- 
ing etiology have been unusually potent in limiting psychologists’ activities. Psy- 
chology has for the most part abandoned its unique subject matter and simply 
borrowed genetic constructs to “explain” mental deficiency. The present writers are 
impressed with the possibility that mental defectives, since they perform psycho- 
logically, are subject to the principles determining all psychological behavior. The 
present study is an effort to discover if intellectually defective behavior per se sug- 
gests the presence of influential psychological factors. This aim was carried out by 
selecting one particular type of response and examining its characteristics, incidence, 
and possible etiological sources. 

Responses in which the content of a reply is only tangentially related to a diffi- 
cult question were selected because they are appropriate to the problem and because 
their systematic investigation has been neglected. An illustration of such behavior 
was provided by a mental defective during a Stanford-Binet examination. When 
instructed to repeat, ““At the summer camp the children get up early in the morning 
to go swimming,” the subject responded, ‘‘No they don’t. They get up to have their 
breakfast.”’ References to such behavior have been noted only casually in descrip- 
tions of mentally defective individuals and diverse labels such as aberrant“) and 
negativistic “®) have been applied. In order to delimit this particular type of be- 
havior the descriptive word evasion has been adopted and will be used throughout 
the remainder of the paper. 


METHOD 


In order to estimate the frequency of evasive behavior at disparate chronologi- 
cal and intellectual levels, 30 subjects were selected from each of three IQ and CA 
groups. The ranges specified in Table 1, which presents the identifying data for the 
subjects, were selected to provide one comparison of discrete IQ levels and one of 
discrete CA levels at ranges which avoid the difficulties encountered when CA cor- 
rections in IQ computation become essential. 


TABLE 1. IDENTIFYING Data FoR ALL SuBJECTS 








Chronological Age Intelligence Quotient 





Mean Range Range 





10-2 90 to 10-11 ; 75 to 85 
10-2 90 to 10-11 : 55 to 65 
13-1 120 to 13-11 4 55 to 65 




















The subjects were chosen from three public institutions caring for mental de- 
fectives. All were free of severe sensory handicap and diagnosed cerebral pathology. 


1When the data for this paper were collected the authors were at Wayne University, Detroit 
Michigan. Appreciation is expressed to Robert H. Haskell, M. D., Medical Superintendent, and 
Thorlief G. Hegge, Ph.D., Director of Education and Research, Wayne County Training School 
Northville, Michigan; to Edwin J. Rennell, M. D., Superintendent, Coldwater State Home and 
Training School, Coldwater, Michigan; and, to Adolph J. Rehn, M. D., Superintendent, Lapeer State 
Home and Training School, Lapeer, Michigan. Special thanks are due to Mrs. Z. Pauline Hoakley, 
Chairman of the Department of Psychology, Wayne County Training School, for her assistance in 
the selection and scheduling of subjects. 
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There are no known biases other than age and IQ in the final sample which con- 
tained 53 males and 37 females. In the majority of cases an IQ had been determined 
within six months of the present investigation. Since MA increases could be ex- 
pected within this interval it was impossible to estimate accurately the MA at the 
time of observation. Inspection of Table 1 reveals that the most probable MA for 
Groups A and C is seven years and eleven months and for Group B, six years and 
two months. 

The lack of procedural precedents for this type of investigation made difficult 
the choice of an observational situation. The Wechsler-Bellevue Intelligence Scale, 
Form I, was selected as an instrument that would present both the desired difficulty 
and permit some success“: *. 4 5 7,% 1). In order to assure a constant number of 
responses any items that might be within the success level of a few persons in the 
sample were given to all subjects. Each item was administered according to standard 
clinical procedure®®. The junior author, who administered all the tests, did not 
know the IQ or CA of a particular subject at the time of testing. 

All incorrect responses were inspected for evidences of evasion. This determina- 
tion was handicapped in some instances by the nature of the test material, in others 
by the failure during the test administration to record any but verbal responses, and 
in others by the nature of the subjects’ responses.? The following items permitted 
a face value identification of evasion and are the only ones included in the present 
paper: (1) Information, questions 0 through 6, 10, 13, and 16; (2) Comprehension, 
items 1, 2, 3, 5, 6, and 7; (3) Arithmetic, questions 1 through 5; and, (4) Similarities, 
questions 1 through 12. Of these items which provide 33 tasks for 90 subjects or 
2,970 requests, 1942 were attempted but failed responses. Of these failures, 617 or 
31% were replies in which the subjects’ verbal responses indicated evasion. These 
data were contributed by 85 of the 90 subjects. The following is a presentation of 
the characteristics of those evasive replies: 

I. Allied Associations. Responses in which the subject fails to achieve success because of an 
answer that is only tangentially related to the question. The relationship between the question 


and answer appears to be derived from contacts with externality that have been frequent and 
contiguous although not necessarily logical. 


A. Unit changes. Answers which include a concept that is qualitatively different from the 
units of the question. 

1. Replies that contain units different from those enumerated in the question. E.g., 
Info.—6: ““How many weeks are there in a year?” “365 days.” 

2. mses outside of a specified population: E.g., Info.—11: “Who was President 

before him?” “Governor Williams.” 
3. References to a specified place or logation in answer to a question beginning with 
the word what. E.g., Info.—3: ““‘What does rubber come from?” “Out of the ground.’» 


B. Descriptions: Replies that reiterate various aspects of the testing situation. 
1. Answers that elaborate the question: The words of the response, which may be any 


number, have a root form common to those of the question. E.g., Comp.—5: “Why 
should we keep away from bad company?” “Bad company take away.” 


2. Replies which embody only one aspect of the question and the aspect is specifically 
mentioned. This type of answer is exclusive to the Similarities subtest. E.g., Sim.—4: 
“In ,what way are a wagon and a bicycle the same?” “Bicycle for something to ride 
on. 

3. Responses that are elaborations of the question but do not contain words that have 
a root form in common with the words of the question. E.g., Comp.—5: ‘Why are 
shoes made of leather?” “You put them on.” 


C. Repetitions: Answers that are characterized by the unelaborated reiteration of individ- 
ual words or parts of the question. E.g., Comp.—6: ‘‘Why does land in the city cost more 
than land in the country?” ‘Land in country.” 


D. Non-repetition. Responses which are tangentially related to the question, do not exceed 
three words, and in which the words are not those used in the question or do not have a 
root form in common with them. E.g., Comp.—21: “What should you do if while sitting 
in the movies you were the first person to discover a fire?” “Matches.” 


*The details of these difficulties and suggested modifications are presented by Fisch‘. 
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II. Irrelevant Associations. Responses in which the entire answer or part of it is apparentl 
unrelated to the question. The content of the reply does not have any logical connection wi 
the question and the likelihood of frequent and contiguous reality contacts is improbable. 


A. References to supernatural persons, places, or conditions when the question has a space 
and time frame of reference. E.g., Tato .—0: “Who is the President of the United States?” 
“Jesus.” Supernatural references that were otherwise non-evasive in response to ques- 
tions containing such stimulus words as “heart,” “statue,” “praise,” and “punishment” 
were not considered to exemplify irrelevant associations in as much as contiguous exper- 
iences have been probable. 


B. Replies that include colleagues and or the family rather than individuals from a popu- 


ulation appropriate to the question. E.g., Info.—1: “Who was President before him?” 
“My mother.” 


C. Responses that are extraneous and unique: These replies do not reiterate parts of the 
uestion and have no observable or inferential relationship to the total question. E.g., 
- rene —6: “Why does land in the city cost more than land in the country?” “Girls.” 


III. Perseveration. Responses in which the subject answers the question by duplicating a res- 
ponse previously given. The replies do not have any logical connection with the question and 
the likelihood of frequent and contiguous reality contacts is improbable. 


A. Replies that are exact reproductions of an entire answer in two consecutive questions 
or in two questions separated by only one intervening question-answer sequence. Persev- 
eration was not counted until the second identical response was given. E.g., Info.—0 & 1: 
“Who is the President of the United States?” ‘George Washington.” “Who was President 
before him?” “Geeorge Washington.” 


B. Responses which involve any particular category such as color or physical character- 
istics in a minimum of three consecutive questions or in three questions separated by only 
one intervening question-answer sequence. Perseveration was not counted until the third 
of these similar responses was given. E.g., Sim.—10, 11, & 12: “In what way are a poem 
and a statue the same?” “Yellow.” “In what way are praise and punishment the same?” 
“Orange.” “In what way are a fly and a tree the same?” “Blue.” 


IV. Negation. Responses in which the subject does not attempt to answer the question’s prim- 
ary thesis but denies its intent or hypothesis. 


A. Denial of Similarity. This type of response is confined to the Similarities subtest. The 
subject sets up a condition and negates it. E.g., Sim.—3: “In what way are a dog anda 
lion the same?” “Dog has a tail, lion doesn’t.” 


B. Direct denial. These are replies in which the subject denies the assumptions of the 
question. E.g., Info.—6: “How many weeks are there in a year?” “None.” 


RESULTS 


Table 2 presents the number of failures in each subtest for each group of sub- 


jects and of these, the percentages that were “no responses,” “don’t know’’ replies, 
evasive answers, and all other types of failures combined. Figure 1 depicts the num- 
ber of evasive responses given by each group of subjects to each subtest. Inspection 
of these reveals that Groups A and C are similar but that the individuals in Group B 
failed to actualize the key part of a question much more frequently than the sub- 
jects in the other two groups. The increased frequency was so consistent that the 
customary tests of significance were not necessary. 


TABLE 2. THe NUMBER OF FAILURES AND THE PERCENTAGES OF Various Ways OF FAILING 








Arithmetic 


Groups 
B 


Comprehension 
Groups 


Information 
Groups 
A B 


Similarities 


Groups 
A B 





N of failures 


Don’t know 
No responses 
Evasive 

All other types 





131 144 


3% 
11% 
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SUBTEST 


Fiaure 1. FREQUENCY oF EvastIvE RESPONSES 


Table 3 lists the percentage frequencies with which the three groups of subjects 
exhibited the categorized types of evasion. It indicates that the least adequate sub- 
jects distinguish themselves from the others by giving more allied repetitions (I-C), 
more irrelevant associations (II), fewer allied descriptions (I-B), and fewer nega- 
tivistic denials (IV-A). 


TABLE 3. THe PERCENTAGES OF VARIOUS TYPES OF 
EvASIVE RESPONSES 
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DIscussIoN 

Conventionally this described deficit, consistently more acute in the least ade- 
quate subjects, would be interpreted etiologically as one manifestation of either 
genetic or anatomical and physiological insufficiency. The fact that the subjects 
in Group B give more repetitions and more irrelevant responses suggests that their 
comprehension may be so much more deficient than the other subjects that they 
have relatively little insight into their inadequacies and simply give any response. 
Such lack of insight is not a unique contribution of this paper but merely an illustra- 
tion of a characteristic of defectives noted since the time of Binet @). 

This conventional genetic interpretation is threatened by the variations among 
the three groups of subjects in the frequency of certain kinds of responses. These 
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do not substantiate an hypothesis of genetic deprivation. In order to account for 
these data and to pursue a theoretical orientation that identifies psychological 
(experiential) factors as influential contributors to psychological events, attention is 
directed to those antecedents which are believed to stimulate evasion. It has been 
observed that punishment of overt hostility tends to reduce the overtly hostile 
response and to promote “surface”? compliance without eliminating a covert atti- 
tude of negativism. Evasive responses could represent such a ‘“‘surface’”’ compliance 
with a concomitant covert negativism. 

It is conceivable that mental defectives have had such a high and /or traumatic 
number of punitive episodes that they maintain a somewhat generalized reduction of 
interest in meeting cultural demands with a resultant intellectual inefficiency. The 
incidence of evasion among the total sample can be interpreted as evidence that 
mental defectives have been subjected to pressure to respond. If this evasion conceals 
covert negativism, a condition exists that would help these subjects to be less in- 
formed about their culture, i. e., have lower MA’s. In this interpretative frame of 
reference the mental defective is exhibiting the neurotic paradox: he wants to be 
adequate but maintains an attitude that hinders rather than promotes this process. 

The fact that a higher frequency of evasion is concomitant with greater clinical 
pathology supports a corollary of the above statement: the least adequate subjects 
have experienced the most pressure. The trend for the subjects in Group B to reduce 
the frequency of ‘‘don’t know”’ responses (need to respond?), to repeat the question 
more often (echolalia?), to give fewer allied descriptions (fewer relevant responses?), 
and to give more irrelevant replies (over-incorporativeness?) would from this point of 
view reflect an accentuated attempt to reply. These responses are accompanied in 
Group B by a markedly reduced frequency in the denial of the assumptions of a valid 
question. This peculiar way of responding to the Similarities subtest as well as the 
giving of fewer allied descriptions would not be expected if only an internal de- 
ficiency manifest as poor comprehension and deficient insight were operative. This 
failure to make an attack upon the request that is more direct than any of the other 
categorized ways of evading suggests that the least adequate subjects are less free to 
criticize then members of the other two groups. That is, the lower MA’s in Group B 
reflect an intensification of paradoxical behavior. 

The evasive responses on which this discussion is based were elicited largely in 
the Similarities subtest (IV-A) and from a small n. Whatever the limitations of the 
sampling the differential between Group A and C on one hand and Group B on the 
other remains. The fact that 85 of the 90 subjects contributed evasive responses in- 
dicates that the results are not the product of the idiosyncratic behavior of a few 
individuals. The fact that the identifying data of the subjects were unknown to the 
examiner increases the probability that the responses are not distorted by personal 
preferences. 

The group variations permit a focussing of a critical area for further research 
within the subject matter of psychology. The present incidence of evasion in 31% of 
the attempted but failed responses makes it apparent that covert rebellion is only 
one variable in mental deficiency. Extension of research to other attitudes and to 
historical episodes might foster multiple causation in one of the few areas of psycho- 
pathology in which generally preferred etiological multiplicity has been largely 
ignored. 


SUMMARY 


This study was designed to determine the characteristics, incidence, and possible 
etiological sources of mental defectives’ evasive attempts to respond to difficult 
items on the Wechsler-Bellevue Scale, Form I. Such replies were given by 85 of 90 
subjects and characterized 31% of their attempted but failed responses. The experi- 
mental design permitted two comparisons of groups with lower and higher MA’s. In 
both comparisons the children with the lower MA’s evidenced evasion consistently 
more frequently than did the subjects with the higher MA’s. An endeavor was made 
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to identify possible antecedents of such behavior and there is evidence to suggest 
that the phenomenon might reflect an attitude of learned negativism. The relatively 
low incidence of evasive responses suggests that experiential antecedents of this 
type will elucidate only one etiological aspect of mental deficiency. 
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A STUDY OF THE APPLICABILITY OF THE H-T-P TO CHILDREN WITH 
RESPECT TO THE DRAWN HOUSE* 


HARRY 8S. BECK 
Illinois Department of Public Instruction 


INTRODUCTION 


In working with mentally handicapped children, one frequently finds that they 
omit what may be considered as essential details in their drawings. Buck “? indicates 
that the omission of essential details may be indicative of intellectual deterioration 
in one known to have been of normal intelligence. The question that arises at this 
point is whether or not essential details, as defined by Buck for adults, have the same 
significance in the case of children, and if so is his interpretation valid for children. 
Very little has been done by way of standardizing or validating the H-T-P with 
children. Jolles® * and Jolles and Beck “: ») did a series of studies in this area which 
to date appear to be the only studies using fairly large samples of a school population. 
The present study was attempted as a means of furnishing further objective evi- 
dence regarding the applicability of Buck’s interpretations of H-T-P drawings when 
applied to children. 

The problems to be investigated are: (1) Does Buck’s definition as to what 
constitutes essential details apply to children? (2) Does Buck’s interpretation of the 
omission of essential details apply to children? (3) Do mentally handicapped organic 
children differ in their drawings from mentally handicapped non-organic children? 
As indicated by the title, the present investigation is limited to the drawing of a 
house rather than the House, Tree, and Person together. 


*The writer wishes to acknowledge his appreciation to Mr. Isaac Jolles for the use of the drawings 
collected for an earlier study. 
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THE SUBJECTS 


The normal population, or control group, was composed of 212 five year old 
boys and girls and 593 six year old boys and girls. Only five and six year old children 
were used in the control group, because it was felt that if children at this age drew 
houses that were recognizable and complete in the matter of essential details, normal 
children of older ages would do as well or better. The control group population is 
the same as described and used by Jolles®: *) except that it includes only the five 
and six year olds. Inasmuch as most of the sample was obtained from school systems 
having Special Education facilities, it is likely that the control group has less than 
the expected 14% to 2% mentally handicapped, since these children would be en- 
rolled in Special Education classes which were not included in the sample. The or- 
ganic group was composed of twenty-five boys and girls, ranging in age from six to 
nineteen years. The term “organic’’ as used here is defined as: ‘any involvement of 
the central nervous system resulting from disease, injury, or malformation occurring 
before, during, or after birth.”” This group was diagnosed as such by means of psy- 
chological and neurological examinations, and in some cases E. E. G.’s were obtained. 

The non-organic mentally handicapped group was composed of thirteen boys 
and girls who were diagnosed as mentally handicapped by means of psychological 
examinations, but from whom no organic signs could be elicited on neurological ex- 
amination. Table 1 indicates the makeup of the experimental groups. 


TaBLE 1. Mean Aags, IQ’s, aNp STANDARD DEVIATIONS OF THE EXPERIMENTAL GROUPS 








Group N Sex 


Mean Age 8. D. 
M F 


Mean IQ 8. D. 





Organic 25 20 5 11-2 


2.98 yr. 


56.77 


8.12 





Non-organic 





13 





8 





9-8 





2.95 yr. 


69.48 


8.24 











PROCEDURE 


In spite of the fact that the experimental groups are older than the control 
group chronologically, the mean mental ages are approximately six years as deter- 
mined from the mean chronological ages and mean IQ’s. This would make them 
roughly equivalent to the six year old section of the control group, since the mean 
mental age of this group should be approximately six years also, if it is a normal 
population as we have assumed it to be. 

The drawings were rated for the following things: was it recognizable as a house, 
was it recognizable but unrealistic or bizarre in appearance, was it unrecognizable, 
were all essential details present, i.e., wall, roof, door, window, and chimney; was one 
essential detail missing, were two essential details missing, were three or more 
essential details missing, or were irrelevant details present. 

The control group was then compared with the organic group on the basis of 
these criteria using chi square. The reason for using the organic group was that the 
writer was interested in this particular group in the beginning and the non-organic 
group was brought in later. A later comparison of the two experimental groups using 
chi square indicated that there were no significant differences between them. In 
fact, the differences between the organic and non-organic mentally handicapped 
groups was so slight as to be considered entirely chance. 


RESULTS 


Table 2 refers to the recognizability and bizarreness of the drawings and indi- 
cates the obtained frequencies, with the expected frequencies being in parentheses. 
It will be seen that organics have a significantly higher number of unrecognizable 
houses than either five or six year olds and that a significantly smaller number of 
organics draw houses that are both recognizable and lacking in bizarreness. It should 
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be noted, however, that the latter probably results from the large number of unrecog- 


nizable houses as the organics obtained the expected frequency on the “recognizable 
but bizarre” item. 


TaBLE 2. RECOGNIZABILITY OF DRAWINGS OF NORMALS AND ORGANICS 








Recogniz- Unrecogniz- Recogniz- 
Group f able but able able and 
Bizarre not Bizarre 


5 year olds 62 5.0 145.0 
(54.70) (3.82) (153.49) 


6 year olds 406 145 2.0* 446.0 
(152.99) (10.67) (429.33) 


Organics 20 7.0 8.0" | 10.0* 




















(6 .45) (0.45) (18.10) 
Totals 533 214.0 15.0 601.0 


























*Indicates significant differences beyond .01 level. Chi Square = 140.26 

With regard to inclusion of irrelevant details there were no significant differ- 
ences between the groups. Table 3 reports the number of essential details omitted 
with the expected frequencies in parentheses. Five year old boys and girls and 
organics differ significantly from six year olds in all categories except that the five 
year olds have the expected number of obtained frequencies in the “two details 
missing’”’ column, as do the six year olds. 


TaBLeE 3. Essentrats Detarts OMITTED BY NORMALS AND ORGANICS 








All Essential 3 or More 
Sex Details 1 Detail 2 Details Details 
Group M F Present Missing Missing Missing 





5 year olds 105 | 146* 50* 
(165.75) (33. 15) (7. 55) 





6 year olds 406 | 494" 75* mi 4 5* 
(464. 10) (92. 84) (21.44) | (14.28) 
| 
| 





Organics 20 é 10* 5* 
(19.50) (3.90) €: 00) 
Totals 533 297 650 130 30 


* 


(.60) 
20 




















\ 





*Indicates significant differences beyond .01 level Chi Square = 102.36 


DISCUSSION 


It would seem from these findings that Buck’s definition of what constitutes 
essential details is applicable to children of six years of age and above but not to 
children below six years. Ideally, however, a study should be made to determine 
whether or not normals will deviate from these norms somewhere between six years 
and adulthood as a result of developmental factors. The addition of irrelevant de- 
tails apparently is a chance factor associated with the individual’s need to structure 
the situation. Since the two experimental groups did not differ significantly, the 
absence of essential details or unrecognizability in the drawing of a house could not 
be used as a means of differential diagnosis with mentally handicapped individuals. 
It might, however, be useful in providing clues to mental deficit or organicity with 
individuals bey ond the age of six years. It would seem that an unrecognizable house, 
or one with three or more essential details missing in a child of six years and over, 
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and of normal intelligence would at least be suspected of organicity. The fact that 
there were no demonstrable differences in the experimental groups should not be 
taken as conclusive evidence, since the non-organic group was particularly small. 
Further study of these groups is indicated with larger samples and with more inclusive 
medical test batteries, especially with the non-organic group. It would have been 
desirable to have had E.E.G.’s on all the non-organic group, but this was not possible 
for various reasons. 


CONCLUSIONS 


1. Buck’s definition of what constitutes essential details in a house is applic- 
able to six year old children in Illinois schools. 


2. Buck’s interpretation of the omission of essential details is also applicable 
to this group. 


3. Mentally handicapped organic and non-organic children seemingly do not 
differ in their drawings of a house. 

4. The inclusion of details in the drawing of a house is, to some extent, a de- 
velopmental process which seemingly matures at about the age of six years insofar 
as essential details are concerned, and therefore, great caution should be used in 
making any interpretations with children below the age of six years. 
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A STUDY OF RORSCHACH FLEXOR AND EXTENSOR HUMAN 
MOVEMENT RESPONSES! 


EMANUEL F. HAMMER AND IRVING JACKS 


New York State Psychiatric Institute 


INTRODUCTION 


Perhaps the most fruitful, as well as deeply tapping, aspect of the Rorschach 
procedure resides in the human movement responses. This is particularly true if the 
responses are accompanied by an actual experience of movement expressed in the 
concept of a displacement of blot area. Piotrowski“ argues that the feeling that all 
or some parts of the ink blot are in the process of changing their relative positions is 
a fundamental requirement for regarding the percept as a true M response. When 
subjects speak of human movement and do not experience this kinesthetic sensation, 
these responses are not primary human movement phenomena but a secondary, or 
intellectually inferred, description. Rorschach“? himself strongly insisted on the 


1Grateful acknowledgement is made to Drs. Bernard C. Glueck, Jr., and Zygmunt A. Piotrowski 
or their helpful suggestions and constant encouragement. 
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distinction between the true experience of movement and the spurious, merely 
verbal, talk about movement. The present study is concerned only with the prim- 
ary M. 

Rorschach’s experience ®?, as well as that of Piotrowski® *®, Beck: *), and 
Klopfer and Kelley“, indicate that the human movement percepts stand in the 
closest relationship to the unconscious and hence provide the broadest percept- 
analytic bridge over which data from the unconscious may pass. 

The type of M response, it is maintained, tends to reveal one’s characterological- 
ly-anchored conception of one’s role in life“: 5 ©. It conveys a deep-rooted tendency, 
and one which is not easily modified, to repeatedly assume the same attitudes in 
dealing with people, in the areas that the subject feels are personally important. The 
type of M reflects the subject’s established role and, as a corollary, suggests conflict 
leading to anxiety when he is required to act in a manner not compatible with that 
type of role. 

The most rewarding conventional breakdown of types of M thus far evolved is 
into the categories of flexor, extensor and blocked M. The extensor M is associated 
with self-assertiveness and a desire to leave the imprint of one’s personality upon 
others. The flexor M is related to compliance, passivity, dependency and lack of 
assertion. Blocked movement responses have been taken to reflect indecisiveness 
and a too-ready tendency to remain immersed in obsessive doubts: 5 ®), 

In the research project? involving a study of different types of sex offenders, 
the writers felt that they had available a unique opportunity to test out these em- 
pirical hypotheses on two groups who represent opposite ends of the continuum 
which runs from over-assertion to crippling lack of assertion. A group of rapists 
typify the position on the former end of the continuum whereas the other end is 
represented by a group of pedophiles, those whose offense consisted of a passive, 
inadequate and seductive appeal to children involving seeing, touching or mani- 
pulating of the child’s genitals. Clinically these pedophiles shape up as a passive, 
helpless, dependent group of individuals, sharply in contrast to the much more 
assertive rapists. In short, the two divergent groups appear to be tailor-made for a 
validity investigation of the hypothesis behind the flexor and extensor differentiation 
of the human movement response. 


SUBJECTS 


One hundred and six men imprisoned in Sing Sing Prison for an illegal overt 
sexual act comprised the subject population. Of the total group, 43 were rapists of 
adult females whose minimum age was 18 (Mean age 31), 40 had one or more sexual 
offenses involving a female child below the age of 15 (Mean age 9), and 22 were in- 
carcerated for a sexual act with a male child below the age of 15 (Mean age 8). The 
remaining case was one of mixed pedophilia where there was a history of sexual ap- 
proaches to children of both sexes. In each case the comparison was made between 
the rapist group and either the heterosexual pedophiles, the homosexual pedophiles, 
or the total pedophile group. 

Cases of statutory rape were excluded from the rapist group; only those in- 
dividuals who had perpetrated an act of assertive aggression or force were included 
because of this requirement for their serving as a representation of the type of in- 
dividual expected to be an extensor M person. 


PROCEDURE 


Since the overwhelming majority of the sex offenders studied offer only one 
human movement response and that to Card III, the writers restricted the present 


2The present paper is a product of the Research Project set up under the auspices of the New York 
State Commissioners of Correction and Mental Hygiene upon the recommendation of Governor 
Dewey. The purpose of the Research Project is to uncover the underlying psychodynamic causes of 
“sex crimes” and to ascertain the treatability of sex offenders. This task was undertaken at the New 
York State Psychiatric Institute, and is under the directorship of Bernard C. Glueck, Jr., M. D 
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initial exploratory investigation to a consideration of the movement responses on 
this card alone. In this manner the problem of varying numbers of M responses from 
different subjects was circumvented. ° 

For the purpose of the present investigation, we have accepted the definition of 
a flexor M as any in which the action is toward the center of the human’s center of 
gravity, such as “bending,” “bowing,” “lowering his hand,” and so on. Extensor 
movements, conversely, include “straightening up,” “walking or running,” “danc- 
ing” and “jumping up to shoot a basket.’”’ Blocked movement was scored when the 
response included a person expending considerable energy with no, or relatively little, 
movement resulting. Examples are ‘‘men pulling in opposite directions,” “people 
pushing against each other” and ‘‘a man balancing himself.” 

Each subject’s Rorschach response was then placed in one of five categories: 
(a) Extensor M, (b) Blend of flexor and extensor, or two different types of M offered 
in more than one M response on Card III, or blocked M, (c) Flexor M, (d) Human 
seen but no movement projected, and (e) No human seen. The first three of these 
categories were given an assigned weight of one, two and three, respectively, thus 
enabling extensor-flexor Means to be tabulated for each of the sub-groups of offend- 
ers. 


RESULTS 

In Table 2, the higher the mean rating, the closer to the flexor end of the scale 
are the responses. The mean ratings are shown for the rapists, heterosexual pedo- 
philes, homosexual pedophiles, and pedophiles combined. 

The rapists show a clear tendency to respond, on the average, with significantly 
greater extensor emphasis.* This is true in comparison with both heterosexual and 
homosexual pedophiles. The clinical observations of rapists and pedophiles are in 
ty with these findings and the hypothesized interpretation of flexor and ex- 
tensor M. 

Of interest was the lack of significant differentiation between heterosexual and 
homosexual pedophiles, with respect to the projection of flexor and extensor M on 
Card III (t=0.76). This would suggest that lowered assertiveness is essentially 
equal for both groups. The motivation for the choice of the sex of the victims thus 
remains to be differentiated on other grounds. 

Turning to the question of humans perceived, it can be seen from Table 3 that 
significantly fewer rapists than pedophiles saw the “popular” humans. This may be 
explained as merely a reflection of the greater hostility of the rapists or else it may 
be viewed as in accord with Glueck’s®? position that it is the rapist’s concept of a 
“dehumanized” approach to sex-figures which allows him to rape. The act of rape 
may then be regarded as being in the service of a type of depersonalized gratification 
in which it is merely the vagina and not the entire person which the rapist cathexes. 


TaBLE 1. BREAKDOWN OF M Responses oF Rapists AND PEDOPHILES 








Heter. Homo. 
Response Rapists Pedo- Pedo- Total 
philes philes Pedophiles 


Extensor M 29 9 9 18 
Blend, or blocked M 3 10 1 11 
Flexor M ll 22 12 33 
Human seen but no 

movement 13 2 2 4 
No human seen 24 10 6 16 




















‘The significant difference might perhaps be even more striking if not for two factors: (a) M repre- 
sents an action-tendency which the individual] strives to realize, but he might frequently not be in a 
position to enable him to do so and in such instances may have to substitute a less comfortable role as 
an alternative. (b) The psychosexual area certainly is a specific one which responds to multi-deter- 
mined inner patterns which act to contaminate the separation of any single one of them“). 





EMANUEL F. HAMMER AND IRVING JACKS 


TasLe 2. Comparison oF Rapists AND PEpDOPHILES RELATING TO TYPE OF 
MoveMENT Seen: Extensor; BLEND oF FLEXOR AND EXTENSOR; FLEXOR 
(Ratep 1, 2, 3, RespecTiIvELy) 








Groups N 8.D. ae 
Rapists 43 : .87 





Heterosexua 
Pedophiles 40 ‘ 81 


Homosexual 
Pedophiles 22 2.14 .99 

















Total Pedophiles 63* 2.25 .87 


*Includes one case of mixed pedophilia 
**Comparison between Rapists and other groups 





TABLE 3. Proportions oF Sex OrreNDER SuBGROUPS FAILING TO SEE 
HuMAN on Carp III 








Subgroup N Proportion 





Rapist 62 .39 


Heterosexual 
Pedophiles 51 .20 


Homosexual 
Pedophiles 34 .18 
Total Pedophiles 86* 19 


*Includes one case of mixed pedophilia 
**Comparison between rapists and other groups 

















Discussion 


One consideration might be raised, although left unanswered for the present, 
and that is the question of validity of projective tests given after commission, de- 
tection and incarceration for an offense. A basic assumption of the present investiga- 
tion, then, is that the personality traits mirrored in the Rorschach M responses do 
not undergo modification significant enough to distort the present conclusions. The 
M responses are, as a matter of fact, the Rorschach components which resist change 
most vigorously “?. 


SUMMARY AND CONCLUSIONS 


The literature“. © contains statements concerning extensor M as a reflection 
of the assertive personality type and flexor M as a product of the submissive and 
passive personality. The investigation of the M responses to Rorschach Card III 
of a group of assertively forceful rapists and a group of passive pedophiles whose sub- 
missive orientation is evident in the nature of their offenses afforded an opportunity 
for the validation of this Rorschach hypothesis. The subject population consisted 
of 106 offenders at Sing Sing Prison, 43 of whom were incarcerated for an act of rape 
against an adult female and the remaining 63 for a sexual appeal to a child between 
the ages of 4 and 15. 

The sharp contrast between the characterologically-rooted divergent orienta- 
tions of these two groups on the variable of assertion-submission produced statis- 
tically significant differences in the type of M projected. This data support the 
hypothesis of extensor M being associated with assertive behavioral orientations and 
flexor M with submissive or passive ones.‘ 


‘These conclusions, however, are restricted to M responses to Card III, the card investigated in 
the present study. 
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A COMPARISON OF H-T-P’S OF RAPISTS AND PEDOPHILES: 
III. THE “DEAD” TREE AS AN INDEX OF PSYCHOPATHOLOGY 


EMANUAL F. HAMMER 


Research Project, Psychiatric Institute 
New York, New York 


INTRODUCTION 


In the study of sex offenders! at Sing Sing Prison, the subjects customarily re- 
ceive an extensive psychological examination as part of their clinical appraisal. 
Since the self-concept and body image are integral factors in motivational dynamics, 
and projective drawings seem especially suited to tap these personality components, 
a series of studies focusing on the H-T-P’s of these men was planned. As a first step 
in the project, the relatively objective factors of the ages attributed to the drawn 
Tree, the drawn male, and female Person, respectively, were chosen. A previous 
paper “) reported the results of this investigation. Support was found for Buck’s“: »? 
hypothesis that the age the subject assigns his drawn Tree may serve as an index of 
his feelings of psycho-sexual maturity or immaturity. 

Clinical experience suggests that it is easier for a subject to attribute more con- 
flicted or emotionally disturbing negative traits and attitudes to the drawn Tree 
than to the drawn Person because the former is “less close to home”’ as a self-portrait. 
The deeper or more forbidden feelings can more readily be projected onto the Tree 
than onto the Person, with less fear of revealing oneself and less need for ego-defensive 
maneuvers. One may, for instance, more readily and unwittingly portray one’s 
feeling of emotional trauma by scarring the drawn Tree’s trunk and truncating its 
branches then by a parallel mutilation of the drawn Person’s face and body and 
similar distortion of the drawn Person’s arms. Buck“: ®) cites the occurrence of 
“dead” Trees as an index of significant and rather serious psychopathology. The 
present investigation concerns itself with a study of the projection of “deadness”’ 
onto the drawn Tree. 


SUBJECTS 


The study group included 84 males at Sing Sing Prison. All of them were im- 
prisoned for an illegal overt sexual act or an illegal act with sexual motivations or 
components. Of the total group, 31 were incarcerated for an overt forceful sexual 


1This is part of a study set up under the auspices of the New York State Commissioners of Cor- 
rection and Mental Hygiene upon the recommendation of Governor Dewey. The purpose of the Re- 
search Project is to try to uncover the underlying dynamic causes of “‘sex crimes” and to ascertain the 
treatability of sex offenders. This task was undertaken at the New York State Psychiatric Institute 
and is under the directorship of Bernard C. Glueck, Jr., M. D. Grateful acknowledgment is made to 
John N. Buck, Bernard C. Glueck, Jr., Zygmunt A. Piotrowski and Selma Landisberg for their en- 
couragement and fruitful comments. 
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approach toward an adult female, 33 for a sexual approach toward a female child 
between the ages of 4 and 15, and 20 for a sexual approach toward a male child be- 
tween the ages of 5 and 15. 

Throughout this study, the reader must remember that the subjects are a 
sampling of individuals apprehended for sex offenses. The distortion in the sample 
brought about by this selective factor can only be conjectured. Apprehended sex 
offenders do not differ from the community at large in regard to 1Q. Their average 
1Q is 101.6. The question remains in regard to motivational and personality factors 
making for detection. We do not, for instance, know whether it is the basic person- 
ality pathology itself or the extent and depth of the guilt feelings prior and sub- 
sequent to the sex offense, which operate toward apprehension by the police. For 
every 100 offenses recorded by the police only about 5 have led to prison sentences ©. 
Therefore the writer’s group may be highly selective, and the writer is not in a posi- 
tion to evaluate the process of selection accurately. He does, however, assume that 
the selective process operates to about an equal extent in the pedophile and the 
rapist-control groups. 

In a sense, the rapists may serve as a control group for the pedophiles since they 
were also detected, tried and incarcerated for a sexual offense, and had been living 
in a more or less common prison environment at the time of testing. Pascal and 
Herzberg have repeatedly found a lack of differentiation between rapists and a 
control group on various variables and conclude that ‘‘the controls and rapists 
are indistinguishable from each other’: »- *), In explanation they state: “If we 
take heterosexual behavior between adults as our standard, then it is difficult to see 
how the rapists differ from the controls in sexual behavior per se’’ ®- #7), 


PROCEDURE 


The H-T-P, because of its relatively unthreatening quality, is presented as the 
first projective technique in the psychological battery. On separate sheets of 8144 by 
7 inch paper“: 2) each subject was requested to draw a House, a Tree, a Person, and 


then a Person of the sex opposite to that drawn immediately prior; thus, four draw- 
ings including one of a male and one of a female Person were obtained. In regard to 
his drawn Tree, the subject was asked whether it depicted a live or a dead Tree. 


RESULTS 


Two of the 31 rapists, five of the 33 heterosexual pedophiles and six of the 20 
homosexual pedophiles drew “dead” Trees. Thus, the rapists, the homosexual 
pedophiles, and the heterosexual pedophiles fall on a continuum in regard to seeing 
their drawn Trees as “dead,” with 6.4%, 15.2% and 30.0% of each group, respect- 
ively, doing so. In three-way comparisons, only the difference between the rapists 
and the homosexual pedophiles (the two ends of the continuum) attains statistical 
significance at the 5% level of confidence (¢ = 2.17). 


DIscussIoNn 


From a logical viewpoint there appear to be two main complementary positions 
to take in interpreting the significant difference obtained between the incidence of 
“dead” Trees of rapists and homosexual pedophiles. One may accept the H-T-P 
hypothesis that the attribute of “‘deadness’’ ascribed to the drawn Tree tends to 
reflect serious psychopathology in the subject. If this is assumed, the data then in- 
dicate that there seems to be a progression from rapists to heterosexual pedophiles, 
to homosexual pedophiles, in regard to the incidence of significant psychopathology 
which parallels the increasing distance from an appropriate sex object. The data 
then suggest that an increasing distance from an appropriate sex object goes hand- 
in-hand with an increase of the likelihood of serious psychopathology. The homo- 
sexual pedophiles, who deviate from the norm in both age and sex of the partner 
chosen, are then to be viewed as the most emotionally crippled sub-group of the sex 
offenders studied. 
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An alternative interpretation is to assume the psychoanalytic viewpoint that 
the more one deviates in one’s choice of a sex object, the more psychopathology is 
indicated. If this psychoanalytic position is accepted, then the data tend to support 
Buck’s “: ?) hypothesis that it is people with the most serious psychopathology who 
are more likely to describe their drawn Trees as “dead.” On a practical level, how- 
ever, both of the above hypotheses would have to be substantially correct for the 
statistically significant data to have emerged. 


SUMMARY 


H-T-P’s were administered to 84 sex offenders at Sing Sing Prison, 31 of whom 
were incarcerated for an act of rape of an adult female, 33 for a sexual approach 
toward a female child, and 20 for a sexual approach toward a male child. After 
each subject completed his drawing of the Tree he was asked whether it depicted a 
live or a dead Tree. The incidence of ‘“‘dead’’ Trees of the three groups was then 
compared. The homosexual pedophiles had a significantly greater incidence of 
“dead” Trees than did the rapists. 

There seems to be a progression from rapists to heterosexual pedophiles to 
homosexual pedophiles in regard to the number who saw their drawn Trees as 
‘“‘dead”’ which parallels the increasing distance from an appropriate sex object. On 
the one hand, these parallel data tend to support Buck’s“’ ?) view that the psycho- 
logically sickest people see their drawn Trees as “dead”’; and on the other hand, 
tend to describe the homosexual pedophiles, who deviate from the norm in both age 
and sex of the partner chosen, as the sickest sub-group of sex offenders studied. 
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A FACTOR ANALYSIS OF SCHIZOPHRENIC RATINGS ON THE 
HOSPITAL ADJUSTMENT SCALE! 


WILSON H. GUERTIN 


Veterans Administration Hospital 
K ille, Iowa 


PROBLEM 


In recent years psychiatric rating scales have enjoyed some popularity as clinical 
instruments®). Their use as clinical tools is probably more familiar to the reader 
than their application in factor analytic research “: ?. 4, 5. 6, 8, 14, 15, 16,17) Investigators, 
employing rating scales, have been able to propose modifications in the realms of 
psychopathological theory “ as well as diagnosis. 

Rating scales sometimes strive toward the more directly observable, in revolt 
against highly abstract concepts and arbitrary classifications. The epitome of low- 
level statements about patients is probably the study of their simple, everyday 
responses. Observations at this level are employed by the Hospital Adjustment 
Seale (HAS), recently developed by Ferguson, McReynolds, and Ballachey “. The 
scale deals with such fundamentally important data about patients that it cannot 
help but attract interest. 

What are the fundamental dimensions underlying schizophrenic adjustment? 
A factor analysis of the HAS should help answer this question. The analysis might 
not only clarify what fundamental dimensions underlie schizophrenic adjustment 
but might also identify HAS items as measures of them. Such a factor analysis, 
translated into practical terms, should provide a more meaningful grouping of HAS 
items into appropriate subscales. 


PROCEDURE 


One hundred hospitalized schizophrenic males were employed in this study. All 
subjects were chronically ill, and no attempt was made to select them as representa- 
tive of the hospital. Rather, they were chosen on the basis of the availability of the 
HAS’s. The many items of the scale, 90 in all, made it necessary to restrict the 
analysis to a portion of them. Thirty-five items were selected from the total number 
so that they represented a sampling from each of the subsections of the scale and 
included all the items of the scale which seemed to the investigator to have unique- 
ness 


The scales were scored by psychiatric charge aides in the recommended fash- 
ion“), Then tetrachoric intercorrelations between the 35 items were obtained from 
four-fold contingency tables and charts. The resulting intercorrelation matrix was 
factored by the multiple-group centroid method “?». Communalities for those items 
within a cluster were estimated by employing the individual’s largest correlation 
within the cluster. Those outside the clusters had communality estimates based 
upon the highest intercorrelation in the whole column. 


RESULTS 


One very large group factor emerged with 28 of the 35 items loaded .40 or greater 
on this factor. Two other clusters were identified by inspection of the first residual 
matrix. Simultaneous extraction of all three clusters then led to the oblique factor 
matrix seen in Table 1. Decimal points have been omitted. All factors were very 
closely related: A with B was .67, A with C was .67, and B with C was .51. 


1From the Veterans Administration, Knoxville, lowa. Author now in Medical Service to Supreme 
Bench, Courthouse, Baltimore 2, Maryland. 
Dr. Theorore T. McKnelly’s contribution in collecting the original data is hereby acknowledged. 





A FACTOR ANALYSIS OF SCHIZOPHRENIC RATINGS ON THE HOSPITAL 


TaBLE 1. Osiigue Factor Matrix or HAS Items 








Factor 





Lack of 
General i Personal 
Abbreviated description i Interest Unconcern 


Interested in nothing 65 
Good job once started -78 
No reaction to entertainment 57 
Needs help to stick to things 68 
Willing to do any chore -51 
Would sit all day 63 
Reads papers and magazines 49 
Doesn’t need job supervision —~44 
Good shop worker -73 
No distinction among aides 55 
Always replies 

Wants to do right 
Follows events in paper 
No pride in appearance 
Will do any recreation 
Asks for work 

Starts conversation 
Does opposite 





No part in recreation 
Talks about sports 
Enjoys being talked to 
Plays cards 

Plays ball 

Doesn’t play ward games 
Talks about family 
Never volunteers information 
Becomes upset 

Yells at aide 

Talks to self 

Answers sensibly 


Needs supervision dressing 
Needs reminding of routine 
Stays neat and clean 

Needs push to follow routine 
Never asks for anything 








The original communality estimates accounted for 76 per cent of the total 
variance of the test items. The final factoring accounted for 83 per cent of this origin- 
ally estimated communality. Thus, apj roximately 63 per cent of the variance of 
the items sampled could be described in terms of the three oblique factors. 


DIscussiIon 


The extremely high communality among the items of the study argues well for 
the reliability of the test instrument. 1t is presumed that the simple and fundamental 
nature of the items is responsible for this reliability in scoring. The three obtained 
factors are as follows: 

Lack of General Interest. This is the largest of the three factors and is quite 
heavily loaded for most items. As such it emphasizes the motivational aspect of 
schizophrenia. Characteristic of this syndrome is the item from the HAS, “Interested 
in nothing’. This syndrome factor embodies slow, disinterested movement and 
general apathy. The syndrome does not appear to include actively defensive tech- 
niques or personal untidiness. Probably none of the conventional four subtypes of 
schizophrenia correspond well with this factor. The catatonic would be too actively 
defensive, while the simple would be not quite apathetic enough. The chronic un- 
differentiated schizophrenic probably comes closest to illustrating this syndrome. 
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Social Withdrawal. This syndrome factor suggests an actively defensive with- 
drawal from social contacts. Individuals with heavy loadings on this factor would 
deliberately avoid entering into activities which involve social intercourse. Paranoid 
individuals may be loaded on this syndrome factor because of the hostility and fear 


which cause them to avoid social contacts. Actively resistive catatonics probably 
also present this picture. 


Personal Unconcern. This syndrome factor is rather specifically related to the 
amount of attention given to personal needs. Characteristic is the item: ‘Needs re- 
minding to attend to routine”. Such subjects are untidy and often soil themselves. 
This factor syndrome seems to embody the essence of what is meant when a chronic 
ward is called a “deteriorated” ward. Hebephrenics and the deteriorated chronic 
undifferentiated schizophrenics best illustrate the syndrome. 


The HAS subsections were subjectively determined by the authors“. There- 
fore, it is quite surprising to find that the nine most heavily loaded items on the Lack 
of General Interest factor of this study are all from the HAS subsection “Work, 
Activities, and Recreation.’”’ Again, the five most heavily loaded items on the Per- 
sonal Unconcern factor of the study are all from the HAS subsection ‘‘Care of Self 
and Social Responsibility”. The remaining HAS subsection “Communication and 
Interpersonal Relations,” seems to contribute little since it does not correspond to 
any of the obtained factors. Perhaps additional syndrome factors would be forth- 
coming if the adjustment of others than schizophrenics were studied. If no other 
factor can be discovered which would correspond to the HAS subsection ‘‘Communi- 
cation and Interpersonal Relations,” the latter should be discontinued as a sub- 
section. A promising substitute would be a new subsection “Social interaction,” 
as identified by the Social Withdrawal factor of this study. 

The thesis by Blair“ on the social behavior of schizophrenics is the only other 
known factor analytic study that investigated the adjustment of schizophrenics. He 
found a highly related triad of oblique factors which corresponded structurally to 
the triad of factors encountered in this study. His first and largest factor, General 
Social Orientation and Capacity, is probably the General Lack of Interest factor of 
this study. His Socio-motoric Responses factor probably corresponds quite well to 
the Social Withdrawal factor of this study. Blair’s third factor of the triad was called 
Reactive Assertion and Reproduction and is probably not sampled by the HAS. 
Conversely, the third factor of this study is not sampled by Blair’s items. Nor do 
Blair’s other three oblique factors seem to be sampled by the HAS items. His addi- 
tional factors were Restitutional Curiosity, Nonchalance and Lack of Preoccupa- 
tion, and Interpersonal Confronting. 

In conclusion, it is felt that the HAS could benefit by revision. At least one 
subscale needs to be added, and some additional items which sample some of the 
areas encountered by Blair would seem desirable. However, the general approach 
of the HAS seems to show promise. The items seem quite reliable (high communali- 
ties) and they concentrate on areas not too well sampled by other psychiatric evalua- 
tions. 

SUMMARY AND CONCLUSIONS 


1. Rather significant research directed toward theory construction has been 
forthcoming recently through the use of rating scales. The Hospital Adjustment 
Scale is factor analyzed here because of both theoretical and practical interests. 


2. One hundred chronic, hospitalized schizophrenics were rated on the HAS 


by Serer aides. Thirty-five of the items were carefully selected for further 
study. 


3. The factor analysis revealed three oblique syndrome factors: Lack of Gen- 


eral Interest, Social Withdrawal, and Personal Unconcern. All the factors were 
closely related to one another. 


4. Suggestions for broadening the usefulness of the scale are made. It is con- 
cluded that the general approach shows promise. 
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A TRANSPOSED ANALYSIS OF THE BENDER-GESTALTS 
OF PARANOID SCHIZOPHRENICS 


WILSON H. GUERTIN 


Veterans Administration Hosyital* 
Knoxville, Iowa 


A study by Guertin and Zilaitis® employed Minnesota Multiphasic Personality 
Inventory items to disclose four types of paranoid schizophrenics. If the four types 
of paranoid schizophrenics are stable enough they will be encountered repeatedly. 
After cross-validations, sound proposals could be made for modifying the current 
classification of ‘paranoid schizophrenia.”” The present study substitutes perform- 
ance on the Bender-Gestalt for the MMPI in order to generalize the conditions of 
the search for the type-factors. Also, veterans are employed, whereas the previous 
study used state hospital patients. 


PROCEDURE 


Thirty hospitalized male schizophrenics were the subjects of this investigation. 
All were recently diagnosed paranoid schizophrenics. Particular care was taken not 
to include subjects who were in complete remission at the time of the examination. 
No data can be given on the number of uncooperative patients that had to be ex- 
cluded from the study because the patients were selected on the basis of the avail- 
ability of their Benders in the files. However, it is our opinion that there would be 
less sample bias operating than in the previous study “) which was biased by exclud- 
ing a few uncooperative paranoids. More serious were exclusions of subjects with ap- 
preciable confusion. The present data are not as biased since Bender performances 
have much meaning and can be retained even though they reflect confusion. 


1Now in Medical Service to Supreme Bench of Baltimore, Courthouse, Baltimore 2, Maryland. 
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The tests were administered in standard manner). Each Bender was scored 
with respect to 100 different items. Distortions on the various figures were evaluated 
by a method which follows conventional scoring procedures, constitutes a rather 
exhaustive analysis of each protocol and is broadly described elsewhere ®: *: 4. ©), 

Tetrachoric intercorrelations between the individuals were obtained from four- 
fold contingency tables and charts. The resulting intercorrelation matrix was factor- 
ed by the multiple-group centroid method“. Communalities for those individuals 
within a cluster were first estimated by employing the individual’s largest correla- 
tion within the cluster. Those outside the clusters had communality estimates based 
upon the highest intercorrelation in the whole column. Plots of the final factor space 
showed that the clusters were best identified by those variables first employed. 
Psychological reports describing behavior and test results other than the Bender- 
Gestalt were available in the files. These reports provided the symptomatic behavior 
data contingent with each of the types of Bender performance. 


REsULTs 


Unlike the earlier study“, the intercorrelation matrix failed to reveal a single 
large group-factor. However, four fairly independent clusters were easily recognized 
through conventional cluster analysis. Simultaneous multiple-group centroid ex- 
traction of these four factors produced the oblique factor matrix reported in Table 1. 
Decimal points have been omitted. 


Tasip 1. Ostique Factor Loapines ror Types oF BENDER PERFORMANCE OF 
PARANOIDS 








Type Factor 





Chronic Inadequate 


Hostile Poorly 
Deteriorated (A) Reactive (B) Integrated (C) Withdrawn (D) 





85 15 
77 -O1 
75 36 
60 15 
35 13 


47 
27 
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Original communality estimates accounted for approximately 56 per cent of the 
total variance of the individuals’ performances on the Bender-Gestalt, and the de- 
rived factors accounted for approximately 82 per cent of this originally estimated 
communality. The correlations of the oblique factors are as follows: .5 for A with B, 
.5 for A with C, .2 for A with D, .3 for B with C, .3 for B with D and .1 for C with D. 

The fairly high communality among subjects of this study points to the common 
features of test performance of these paranoid schizophrenics. The resolution into 
four separate factors demonstrates that these common features are not all shared 
with one another but depend upon four different types of paranoid schizophrenic 
Bender performance. 


THe CHRONIC AND DETERIORATED PARANOID (TYPE A) 


Characteristic of the Bender productions of individuals of this type are: (1) 
Many minor distortions despite evidence that they are doing the best they can, 
(2) their obvious failures on the more difficult portions of the figures, (3) a tendency 
to use slightly too much paper, and (4) the fluidity in the drawing of curved lines. 
Rather direct interpretations of these Bender features, which are supported by the 
symptom pictures of the individuals are: (a) Some deterioration of former abilities, 
often noted in older patients, (b) the accompanying poor visual-motor skill, (c) a 
poor recognition of their shortcomings, (d) the over-use of denial as a defense with 
avoidance of self-reflection, and (e) inroads upon emotional controls. 


Tue HostiLte ReEActTIvE PARANOID (Type B) 


Characteristic of this type of paranoid are: (1) No particular care taken in 
drawing, (2) crowding and poor placement on the page, and (3) fluidity in the draw- 
ing of curved lines. Direct interpretations from these Bender characteristics, which 
are supported also by the behavior of the individuals are: (a) Compensatory feeling 


of adequacy, (b) denial and evasion used as defenses, (c) considerable resistiveness 
and hostility, (d) carelessness, (e) impulsiveness, and (f) poor planning. 


Tue Poorty INTEGRATED PaRANorpD (Type C) 


Subjects of this type showed Bender-Gestalt performances characterized by: 
(1) Poor utilization of space and excessive paper, and (2), a stretching of some 
figures in a single dimension. Direct interpretations of the Bender-Gestalt, which 
are supported by the patient’s symptoms, are (a) A rather acute dissolution of con- 
trols which results in expansiveness, (b) weak general personality integration, (c) 
poor planning, (d) emotional thinking with blind spots and delusions, (e) unreason- 
ableness, and (f) a generalized hostility toward the world. 


THE INADEQUATE AND WITHDRAWN ParRANorp (Type D) 


These individuals show the following Bender characteristics: (1) Irregular order, 
(2) small figures, (3) crowding onto one page or less, and (4) some gross distortions 
appearing along with very well-drawn figures. Direct interpretations of these Bender 
characteristics, supported by the behavior are: (a) Fluctuating reality contact which 
may be described as clouding of consciousness, (b) temporary losses of control, (c) 
perplexity, and (d) general withdrawal. 


Discussion 


Three of the above factors seem to correspond to three of the four factors in the 
previous study. The present Hostile Reactive Paranoid type probably corresponds 
to the Evasive and Well-Integrated Paranoid type of the previous study. The Poorly 
Integrated type probably corresponds, in part, to the Grandiose and Delusional 
Paranoid type of the previous study. The Inadequate and Withdrawn type seems to 
correspond to the Sensitive, Inadequate, and Withdrawn type of the previous study. 
The one factor of this study not previously encountered, the Chronic and Deterior- 
ated Paranoid type, probably was excluded from the previous study by the relia- 
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bility screening test. The deterioration of abilities would probably have prevented 
the subjects from sorting the MMPI items reliably enough to be retained in that 
study. The Socially Normal Paranoid type, previously encountered but missing in 
this study is probably attributable to the smaller variance of patients’ scores on the 
MMPI. A much wider range of variation probably is encountered for the scoring 
items of the Bender-Gestalt than for the MMPI items. In other words, the common 
species factor, appearing so strongly in the first study, almost vanishes here because 
of the increased variability of performance under the different test conditions. 
While the present study is quite suggestive, it is felt that it is too soon to propose a 
classificatory change. It would be desirable to conduct further research with trans- 
posed factor analyses of paranoid schizophrenics on mental status items. 


SUMMARY AND CONCLUSIONS 


1. A previous study disclosed four type-factors for paranoid schizophrenics. 
The current study is a cross-validation of those findings under generalized conditions. 
Performance items of the Bender-Gestalt are substituted for MMPI items. 


2. Thirty hospitalized male paranoid schizophrenics provided the Benders 
which were scored for 100 items. Tetrachoric correlations between individuals were 
calculated. The resulting intercorrelation matrix was factored by the multiple-group 
method. 


3. Communality was quite high. The following four factors were obtained: 
A. Chronic and deteriorated paranoid, B. Hostile reactive paranoid, C. Poorly in- 
tegrated paranoid, and D. Inadequate and withdrawn paranoid. 


4. Good correspondence is found between these factors and those previously 
obtained. Suggestions for further work are made. 
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THE EFFECT OF SOCIAL CLASS ON THE LENGTH AND TYPE OF 
PSYCHOTHERAPY IN A VETERANS ADMINISTRATION MENTAL 
HYGIENE CLINIC! 


ALVIN E. WINDER AND MARVIN HERSKO 


Veterans Administration Mental Hygiene Clinic 
Miami, Florida 


PROBLEM 


In recent years, American social scientists have become increasingly sensitive 
to social class differences. Warner!) and Hollingshead “ have developed and vali- 
dated social status scales and many others®: >. 7. 1°. ™) have applied these measures 
to varieties of individual, group and institutional behavior. This area is presently 
being explored by the most recent discipline of social psychiatry. Most recently, a 
large scale research on social structure and psychiatric treatment was carried out 
by the departments of psychiatry and sociology at Yale University. Robinson, 
Redlich and Myers“? report that even when economic factors are controlled by 
dealing separately with treatment agencies, psychotherapeutic methods are applied 
in disproportionately high degree to upper social levels. Auld and Myers, in an 
analysis of the relationship of length of treatment to social class in 65 clinic patients 
obtained from the Yale study, find that for this clinic population middle class 
patients stay in psychotherapy significantly longer than lower class patients. Clinic 
patients in the Yale study were obtained from an out-patient psychiatric clinic. 

Robinson, Redlich and Myers? state in their discussion, “‘In any event, little 
psychotherapy is given to lower class patients. (It is clearly understood that psy- 
chotherapy can be, and is applied to some lower class patients in the Mental Hygiene 
Clinics of the Veterans Administration.)’”? Auld and Myers remark, “A word of 
caution is in order, however, we are not sure that the relationship between social 
position and length of treatment would hold in other clinics. This must be deter- 
mined by further studies.’ The present study was designed in order to obtain furth- 
er data in this important area. 


PROCEDURE 


One hundred patients were chosen at random from a total population of 1250 
veterans who had received or were receiving psychotherapy at a Veterans Adminis- 
tration Mental Hygiene Out-Patient Clinic. Individuals who entered treatment less 
than six months before the study was undertaken were excluded from the sample. 
Patients included in the sample average 38.1 years in age while the range is from 
27 to 60. There were no female patients in the sample, a situation due to chance but 
one which reflects the heavy preponderance of male veterans. Of the hundred pa- 
tients 68 were diagnosed as neurotic, 17 were diagnosed as psychotic and 15 were 
diagnosed as suffering from an organic condition but given psychotherapy on an 
adjunctive basis. 

In accordance with previous research in this area, Hollingshead’s Index of Social 
Position was used to determine the patients’ social class. However, because of the 
nature of the population, many of whom have recently moved to the geographical 
area served by the Clinic, place of residence was not used as a measure of social class. 
The Index scores permitted the assignment of each patient to one of five social levels. 
Class I comprises families of wealth, education, and top social prestige. Class II 
consists of families in which the adults, for the most part, hold college degrees and in 
which the fathers have professional or high-level managerial occupations. Class III 
includes proprietors, white-collar workers, and skilled workers; they are mostly 


ag | ~ Veterans Administration Regional Office, Miami, Florida. 
p. ‘ 
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high-school graduates. Class IV consists largely of semi-skilled workers and laborers, 
with less than high-school education. Class V includes unskilled and semi-skilled 
workers, who have a grade-school education or less. 

The treatment received by the 100 patients in our sample was analyzed with 
respect to two variables, type of psychotherapy and duration of therapy. It was 
found that the therapeutic approach could be placed in one of two categories, An- 
alytic psychotherapy or Relationship therapy. The former is taken to mean therapy 
short of full analysis, practiced by a trained psychiatrist or psychologist and based 
upon psychoanalytic principles. Relationship therapy includes all treatment in 
which the approach was predominantly supportive, directive, suggestive or some 
combination thereof. Duration of treatment was broken down, more or less arbi- 
trarily, into three categories, 1 to 9 sessions, 10 to 19 sessions and over 20 sessions. 
X? was used to evaluate the significance of the relationships between type of psy- 
chotherapy, duration of psychotherapy and social class. 


RESULTS 


None of the patients in our sample fell into Class I. To facilitate the statistical 
analysis, Classes II and III were combined and considered middle class. Similarly, 
Classes IV and V were combined and considered lower class. Table 1 presents the 
data on the relationship between length of psychotherapy and social class of the 
patient. The findings indicate that patients in Classes II and III tend to receive 
more psychotherapy sessions than patients in Classes IV and V. X? reveals, that 
this relationship is significant at the 5% level. 


TaBLE 1. Tue Revation or LENGTH AND Type oF PsYCHOTHERAPY AND 
SocraL Crass 








Factors Number of Patients 





Classes IT and III Classes ITV and V 





Number of Interviews 
1to9 13 

10 to 19 20 

20 and over 15 
Total 48 





d.f.=2 P<.05 





Type of eeenney 
Analytic 28 10 
Relationship 24 38 











X? = 11.3 df. = 1 P<.01 





Table 1 also presents the findings with respect to the relationship between type 
of psychotherapy and the social class of the patient. The findings indicate that 
patients in Classes II and III tend to receive more Analytic Psychotherapy and less 
Relationship therapy than is the case with patients in classes IV and V. This 
difference is significant at better than the 1% level. 


Discussion 


The authors feel that the patient’s perception both of the clinic and of the 
treatment situation are important in understanding the data in Table 1. Social 
scientists: 4. 9) who have studied the relationship between social class and social 
institutions find that the lower class attitude towards social institutions is character- 
ized by rebellion and a feeling that these institutions do not understand or try to 
help them. It is likely then that the Mental Hygiene Clinic is viewed by lower class 
patients as a hostile, middle-class institution. Their attitude toward the treatment 
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situation then is likely to display a counter hostility. This may take the form of 
either aggressive or overly demanding behavior. 

On the other hand, middle class patients are said to be characterized as coming 
from a culture which instills conformance and excessive repressive tendencies. Mem- 
bers of this class have been described as orderly, conscientious, responsible and tame. 
These qualities have been found to be most acceptable and most rewarded by mid- 
dle class institutions. Therefore these individuals in their clinic contacts may tend 
to fulfill their own needs and those of the clinic to a greater extent than do lower 
class patients and so remain longer in treatment. 

Another aspect of this problem is the relationship between social class and type 
of psychotherapeutic approach. Recent research“: *) has pointed out that thera- 
pists are by and large members of the middle class and may therefore feel more at 
ease with and possibly give preferential treatment to patients whose social and 
cultural background is similar to their own. This may serve as a partial explanation 
of the present findings. Some specific factors that may predispose the middle class 
therapist to prefer to give analytical psychotherapy to middle class patients are their 
greater interest in self understanding and their readier acceptance of the psycho- 
therapeutic relationship. 

The findings in this study support those of the researches cited above“: ® in 
that middle class patients receive significantly more analytically oriented psycho- 
therapy and stay in treatment for a longer period of time. However, they depart 
from previous findings in that approximately half of the clinic population receiving 
therapy is from the lower class. This suggests that lower class patients can be treated 
psychotherapeutically. More research in this area should lead to better understand- 
ing of the needs of lower class patients and to the development of psychotherapeutic 
techniques to better meet these needs. 


SUMMARY 


A 100 patient sample from a Veterans Administration Mental Hygiene Clinic 
was analyzed with respect to social class, duration of psychotherapy, and psycho- 
therapeutic approach. It was found that: 


1. Approximately half of the patients were members of the lower class. 


2. Middle class patients tend to remain in treatment for a significantly greater 
number of psychotherapy sessions than do lower class patients. 


3. There is a significant tendency for more middle class patients than lower 
class patients to receive analytically oriented psychotherapy. 
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THE RELATION OF RORSCHACH INDICES OF EXTRAVERSION- 
INTROVERSION TO CERTAIN DREAM DIMENSIONS 


LESTER MANN 
Mental Hygiene Clinic of Raleigh and Wake County, North Carolina 


PROBLEM 


Rorschach early intimated relationships between his test’s indices of extra- 
version-introversion and dreams“). However he did not elaborate them and sub- 
sequent investigators have neglected the problem of which the present study is a pre- 
liminary inquiry. An a-posteriori approach was adopted in keeping with the ex- 
ploratory nature of the study. Single dreams were obtained from subjects who were 
administered the Rorschach test. They were analyzed to determine those dream 
dimensions permitting abstraction. The subjects were then compared in regard 
to their position on these dimensions and their extraversion-introversion ratings on 
the Rorschach test. 

The technique of dream elicitation employed was one suggested by Balken“?; 
subjects are simply requested to tell any dream they can remember. It is communal 
with the projective techniques in the unstructured nature of the task it affords. 

The sample of dreams thus obtained could be ordered according to the following 
dimensions: 

1. Completeness. The extent to which the dream approaches a complete narrative, i.e., 
proceeds from a beginning through a central exposition and reaches a conclusion. 

2. Detailedness. The degree of detail elaborated in the dream. 

3. Pleasantness-unpleasantness. The extent of pleasant or unpleasant content in the dream. 


4. Recency. The temporal placement of the dream, e.g., whether from childhood or the night 
prior to the investigation. 


5. Recurrency-non recurrency. Whether the dream is recurrent or not; if recurrent, the fre- 
quency of the recurrency. 


An attempt was made to delineate other dimensions as well, e.g., wish fulfill- 
ment, anxiety, aggression. These, however, were either not present in sufficient degree 
or were too ambiguous, in the sample studied, to permit abstraction. The dimensions 
presented above were clearly observable and mutually exclusive. 

The following Rorschach indices of extraversion-introversion were studied in 
relation to the above dream dimensions: M, C total, C sum, M:C total, M:C sum, 
FM+m, Fe+c+C’, FM+m:Fc+c+C’. The first five of these are the original 
E-I indices evolved by Rorschach. The latter three were derived by Klopfer®?. 


METHOD 


Forty six undergraduate students served as subjects. The Rorschach test and 
dream technique were administered individually, in that order. The Rorschach was 
administered according to Klopfer’s directions“. To obtain the dreams the sub- 
jects were asked to tell ‘‘any dream you can remember’. 

Rorschach protocols were scored for M, C total, and C sum according to the 
rules prescribed by Beck®’. FM, m, Fc, c, and C’ were scored in accordance with 
Klopfer“. A high degree of reliability in scoring was found in comparing the ex- 
aminer’s judgements with those of three independent judges. 

Chi square was used to test the relationship between Rorschach E-I indices and 
the various dream dimensions. The subjects were ranked according to their position 
on the dream dimensions and Rorschach indices. The distributions thus obtained 
were dichotomized to permit the Chi square tests. Corrections for continuity were 
made where necessary. 

Response totals significantly affected certain of the Rorschach indices, i.e., 
M, C total, FM+m, and Fce+c+C’, in the entire distribution of subjects. This factor 
was successfully controlled by studying such indices in sub groups formed on the 
basis of R, as recommended by Cronbach ®?. 
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RESULTS AND DIscuUssION 


A total of forty tests of significance were made in regard to the relation of 
Rorschach E-I indices to the dream dimensions. No test was significant at or beyond 
the 5% level. Only one, that for the negative relation of M to completeness, was 
beyond the 10% level. 

The present study sustains the null hypothesis in regard to the relation of 
Rorschach E-I indices to the dream dimensions studied. It is possible, of course, 
that the dream dimensions permitting of study in this investigation were not 
pertinent ones as regards the Rorschach E-I indices. A more probable consideration 
is that isolated dreams do not provide a sufficiently reliable and revelatory sample 
of an individual’s ‘dream style”. Future research may be more profitable if directed 
towards dream series, to determine primary characteristics and thema. 


SUMMARY 


A study was made of the relationships between Rorschach E-I indices and 
certain dream dimensions. Forty six subjects were administered the Rorschach test 
and asked to relate ‘any dream you can remember”. The following dimensions 
could be abstracted from these dreams: Completeness, detailedness, pleasant-un- 
pleasant, recency, recurrency-non recurrency. These were studied in relation to: 
M, C total, C sum, M:C total, M:C sum, FM+m, Fe+c+C’, FM+m: FC+c+C’. 
Chi square was used to test significance. None of the relationships were significant 
at or beyond the 5% level. 
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PREDICTION OF INTELLIGENCE FROM CERTAIN RORSCHACH 
FACTORS 


ELIAS N. ABRAMS 
Veterans Administration Regional Office, Brooklyn, New York 


PROBLEM 


Estimating intelligence from an integration of Rorschach factors has been at- 
tempted by many clinical psychologists. These estimates are usually expressed in 
such terms as “feeble-minded,” “dull normal,” “low-average,”’ “high average,” 
“superior,” and ‘‘very superior.’’®) These general estimates may be satisfying to 
the clinician or therapist but do not gainsay the fact that traditionally the psycho- 
logist is expected to express levels of intelligence as scores or at least as ranges. The 
present study has a two fold objective: (1) to determine the degree of relationship 
between certain Rorschach factors in combination and a global estimate of intelli- 
gence as indicated by the Wechsler-Bellevue Full Scale 1Q, and (2) should this re- 
lationship prove to be significant to construct a formula to derive estimates of the 
Wechsler-Bellevue Full Scale IQ from the combined Rorschach factors. 

Rorschach stated that a number of elements in his test are related to intell- 
igence. The components influenced by this faculty are: 
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(1) Emotional stability or capacity for inner creativity as indicated by the number of 
acceptable human movement responses (M’s). 


(2) Critical judgment, form quality or perceptual clarity of configurations and of associative 
processes as disclosed by the Rorschach F + per cent. 


(3) Capacity for organization and integration in producing complicated performances as 
revealed by the number of acceptable whole responses (W’s). 


(4) Productivity, freedom from inhibition or the wealth of association and flexibility as 
expressed in the total number of responses (R). 


Rorschach further states that there is a gradation in the development of these 


capacities to the end that ‘‘there may be an incalculable number of variants, partial 
and total, of intelligence.” 


METHOD 


The subjects of this investigation were 400 clinic patients, veterans of World 
War II, examined at the Mental Hygiene Clinic, Veterans Administration Regional 
Office, Newark, New Jersey. Each of these subjects was administered individually 
the eleven sub-tests of the Wechsler-Bellevue and the Rorschach. These veterans 
had a mean chronological age of 28.7 years and their average level of educational 
achievement was the 10th grade. No attempt was made to control the psychiatric 
classification of these subjects. They varied from the mildest of psychoneurotic 
disturbances to the most severe of intracranial organic pathologies that might be 
encountered in an outpatient setting. The Rorschach test was administered with 
protocols scored and tabulated by Klopfer’s“? method. The Wechsler-Bellevue re- 
sults were obtained with standard instructions. 

After calculation of means and standard deviations, Pearson product-moment 
coefficients of correlation were computed for each of the four Rorschach variables 
and the Full Scale IQ. It then became necessary to evolve a weighting technique 
whereby the Rorschach variables could be equitably combined in order to yield a 
Rorschach score for each of the subjects. A product-moment correlation coefficient 
between the Rorschach score and the Full Scale IQ was calculated and became the 
basis for the regression equation from which predictions could be made from the com- 
posite of the four Rorschach variables, the Rorschach score, to the Full Scale1Q. A 


formula was then derived in terms of weights in order to predict the Full Scale IQ 
from the Rorschach raw scores. “: 2 5) 


RESULTS 


The intercorrelations between the variables which provided the basis for the 
formula were as follows: 


Full Seale IQ — and Rorschach F+% 354 
Full Scale 1Q and M 360 
Full Scale 1Q and W 358 
Full Scale 1Q and R 329 
Rorschach F+% and M 214 
Rorschach F+% and W .147 
Rorschach F+% and R .090 
M and W 396 
M and R 439 
WwW and R .296 


These correlation coefficients for the individual Rorschach elements are quite low. 
When weighted, combined and correlated with Full Scale 1Q the degree of relation- 
ship for the four elements rose to .525 + .024 (PE). (By means of multiple regression 
a similar result was obtained, i.e., .529 + .024) 

The formula evolved for the ready transmutation of the Rorschach raw scores 


into Full Scale 1Q by simple fundamental operations is expressed in equation form 
as follows: 
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2a + 17b + 9c + 2d 
x = 





+ 76 
10 


where ais the Rorschach F+% 
b is the number of acceptable M’s 
c is the number of acceptable W’s 
d is the total number of responses (R) 
X is the estimated measure of intelligence equivalent to the 
Wechsler-Bellevue Full Scale IQ 


Transformations produced by the use of the predictive formula show that the de- 
rived Rorschach ‘IQ’ has a mean of 103.68 as compared with obtained mean of 
103.74 for the Wechsler-Bellevue Full Scale IQ. ‘The standard deviation of the Full 
Scale IQ is 16.25, whereas that of the Rorschach is 8.42. Evidently the former is 
almost twice as variable as the latter. 


SUMMARY AND CONCLUSIONS 


A group of 400 cases of various diagnostic classifications was examined with the 
Rorschach and the Wechsler-Bellevue tests at the Mental Hygiene Clinic, Veterans 
Administration Regional Office, Newark, New Jersey. The objectives were (1) to 
determine the relationships between certain Rorschach elements (F+%, M, W, 
R) and the Wechsler-Bellevue Full Scale 1Q, and (2) to estimate the Full Scale IQ 
from a composite of these elements. 

A formuls. was derived for calculating a Full Scale IQ from the given Rorschach 
elements. The indication derived from the size of the Pearson r (.525) is that pre- 
dictive estimates by means of the formula could be made with a coefficient of de- 
pendability of fifteen per cent or that quite accurate predictions could be made in 
two cases out of three. This raises the further hypothesis that, were a greater num- 
ber of elements introduced, the degree of relationship might be increased and pre- 
dictions made with improved accuracy. 

Another qualification is that, since the size of the constant to be added is so 
large, the formula for prediction may not be used with individuals who are suspected 
of being mentally deficient (supposedly weeded out before induction into service or 
during basic training). It must not be overlooked that the prediction formula was 
derived from data obtained on clinic patients and, therefore, its applicability must 
be restricted to estimates made on similar populations. 
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THE RELATIONSHIP BETWEEN RAVEN’S COLORED PROGRESSIVE 
MATRICES AND TWO TESTS OF GENERAL INTELLIGENCE 


CHALMERS L. STACEY AND FREDERICK O. CARLETON 


Syracuse University 


PROBLEM 


Martin and Wiechers, in a recent study“, reported on the correlations they 
found between the performance of a group of children on Raven’s Colored Progress- 
ive Matrices and WISC. The subjects were one hundred school children between 
their ninth and tenth birthdays. The correlations between the Matrices test and the 
several WISC IQ’s were as follows: Matrices and WISC Full IQ, .91; Matrices and 
Verbal IQ, .84; and Matrices and Performance 1Q, .83. The authors report also the 
intercorrelations among the Matrices and the various subtests of the WISC. These 
ranged from .74 (Block Design) to .47 (Information). 

The purpose of the present study was to investigate the extent to which WISC 
and Stanford-Binet scores compared with. performance on Raven’s Colored Pro- 
gressive Matrices for subjects of a restricted range of intelligence. 


SUBJECTS AND PROCEDURE 


One hundred and fifty children referred to the Syracuse State School as possible 
mental defectives were administered the Colored Progressive Matrices, the Revised 
Stanford-Binet, Form L, and the WISC. The sample consisted of 80 girls and 70 
boys. All three tests were administered individually; for one half of the group the 
Stanford-Binet was administered first and for the other half the WISC was given 
first; for all subjects the Matrices was administered last. All three tests were given in 
one period. 


Table 1 presents descriptive statistics concerning chronological age and general 


intelligence of the 150 subjects. From this table it can be seen that the subjects 


Tasie 1. Rance, MEAN, STANDARD DEVIATION AND PEARSON Propuct MoMENT CoRRELATION WITH 
RaveEn’s PROGRESSIVE MatTRICES FOR CHRONOLOGICAL AGE AND VARIOUS MEASURES OF GENERAL 
INTELLIGENCE ON 150 Sussects (S.E., .08). 








Measures Range Mean Pearson r 





Raven’s Progressive Matrices 8-29 
Chronological Age (years) 7.42-15.75 
Stanford-Binet (Form L) MA (years) 5.17-12.67 
Stanford-Binet (Form L) IQ 39-89 
WISC Verbal IQ 42-91 
WISC Performance IQ 46-115 
WISC Full Scale IQ 36-91 
WISC Verbal (Weighted Score) 3-51 
WISC Performance (Wt. Score) 13-68 
WISC Full Scale (Wt. Score) 16-106 


WISC Subtests (Weighted Scores) 
Picture Completion 0-16 
Digit Span 0-12 
Arithmetic 1-9 
Block Design 0-13 
Information 
Similarities 0-11 
Picture Arrangement 0-14 
Vocabulary 0-10 
Object Arrangement 0-16 
Completion 0-12 
Coding 0-15 
Mazes 
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ranged in chronological age from 7 years 5 months to 15 years 9 months, with a mean 
of 12 years 6 months. Scores on the Colored Matrices ranged from 8 to 29 with a 
mean of 18.73 and a standard deviation of 4.69. The range in Stanford-Binet MA’s 
was from 5 years 2 months to 12 years 8 months with a mean of 8 years 4 months. 
The range in Stanford-Binet IQ’s was from 39 to 89, with a mean of 67.02. On the 
WISC Full Seale IQ scores ranged from 36 to 91 with a mean of 68.35. 


RESULTS 


The correlations between the Matrices scores and Stanford-Binet MA’s and 
1Q’s, WISC 1Q’s and weighted scores, and WISC subtests weighted scores are pre- 
sented in Table 1. Correlations of .69 and .71 were obtained between Matrices scores 
and Stanford-Binet MA’s and IQ’s. Correlations of .54, .52, and .55 were obtained 
between Matrices scores and WISC Verbal, Performance, and Full Scale 1Q’s. The 
correlations between Matrices scores and WISC weighted Verbal, Performance, and 
Full Scale scores were .55, .51, and .62. It is interesting that the correlations between 
Matrices scores and Stanford-Binet MA’s and I1Q’s were quite similar and at the 
same time higher than those between the Matrices scores and WISC IQ’s and weight- 
ed scores. The correlations between Matrices scores and the 12 subtest weighted 
scores ranged from .48 (Picture Completion) to .28 (Coding and Mazes). 


SUMMARY 


Raven’s Colored Progressive Matrices, the Revised Stanford-Binet (Form L), 
and the WISC were administered to 150 children between the ages of 7 years 5 
months and 15 years 9 months. All these children had been referred to the Syracuse 
State School as possible mental defectives. Not all of the subjects, however, were 
retained in the institution. Pearson product-moment correlations were obtained 
between Matrices scores and Stanford-Binet MA’s and IQ’s, WISC IQ’s and weight- 


ed scores, and subtests weighted scores. Correlations obtained between Matrices 
scores and the Stanford-Binet MA’s and IQ’s were similar but higher than the cor- 
relations between the Matrices scores and WISC 1Q’s and weighted scores. Among 
the correlations between Matrices scores and WISC IQ’s and weighted scores, the 
highest found was that for the Full weighted scores. The correlations between 
Matrices scores and the WISC weighted subtests scores ranged from .48 (P. C.) to 
.28 (Coding and Mazes). 
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THE RELATIONSHIP BETWEEN RAVEN’S COLORED PROGRESSIVE 
MATRICES AND TWO TESTS OF GENERAL INTELLIGENCE 
FOR 172 SUBNORMAL ADULT SUBJECTS 


CHALMERS L. STACEY AND MARIE R. GILL 


Syracuse University 


PROBLEM 


Two recent studies have reported on the correlations found between Raven’s 
Colored Progressive Matrices and other tests of general intelligence. Martin and 
Wiechers“? dealt with the WISC and Stacey and Carleton“? used the WISC and 
the revised Stanford-Binet. The purpose of this study was to investigate the extent 
to which the Wechsler-Bellevue and Stanford-Binet scores compared with perform- 


ance on Raven’s Colored Profressive Matrices for a group of subnormal adult sub- 
jects. 


SUBJECTS AND PROCEDURE 


One hundred and seventy-two subnormal adult subjects under the jurisdiction 
of the Syracuse State School were administered the Colored Progressive Matrices, 
the revised Stanford-Binet, Form L, and the W-B. The sample consisted of 105 
females and 67 males. One hundred and thirty-five of the subjects were not living 
within the institution at the time of testing. They were on what is known as ‘‘Con- 
valescent Care’ working and living in the community. The females were employed 
in domestic work and lived with the families who employed them. The males were 
employed in various occupations, such as dishwashers in restaurants, elevator opera- 
tors, etc., and lived in rooming houses obtained for them through the Social Service 
department of the institution. 

All three tests were administered individually; for one half of the group the 


Stanford-Binet was administered first and for the other half the W-B was given 
first; for all subjects the Matrices was administered last. All three tests were given 
in one period. 


TaBie 1. Ranos, Mgan, STANDARD DEVIATION AND PEARSON Propvct Moment CoRRELATION 
wit Raven’s CoLorEeD PRoGREsSIVE MATRICES FOR VARIOUS MEASURES OF GENERAL 
INTELLIGENCE ON 172 SuBNoRMAL ApuLT Sussects (8. E.r + .07) 








Range Mean 


Raven’s Colored Prog. Matrices 11-34 20.8 
Chronological Age (years) 16 .0-57.5 
Rev. S-Binet (Form L) I.Q. 40-84 
W-B Verbal I.Q. 53-93 
W-B Performance I.Q. 51-121 
W-B Full Scale 1.Q. 46-102 
W-B Verbal (Weighted Score) 11441 
W-B Performance (Wt. Score) 6-66 
W-B Full Scale (Wt. Score) 28-107 


W-B Subtests (Weighted Scores) 
Block Design 1-14 
Picture Arrangement 0-14 
Picture Completion 0-13 
Similarities 1-8 

Vocabulary 1-10 
Comprehension 1-11 
Information 1-10 
Digit-Symbol : 
Object-Assembly 0-15 
Digit-Span 0-11 
Arithmetic 0-9 
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Table 1 presents descriptive statistics concerning chronological age and general 
intelligence of the 172 subjects. The subjects ranged in chronological age from 16 
years to 57 years 6 months, with a mean of 23 years 10 months. One hundred and 
sixty-seven of the subjects were in the age range between 16 and 39 years. Scores 
on the Colored Matrices ranged from 11 to 34 with a mean of 20.8. The range in 
Stanford-Binet 1Q’s was from 40 to 84 with a mean of 63.8. On the W-B Full Scale 
1Q scores ranged from 46 to 102 with a mean of 69.7. 


RESULTS 


The correlations between the Matrices scores and Stanford-Binet 1Q’s, W-B 
IQ’s and weighted scores are presented in Table 1. A correlation of .86 was obtained 
between Matrices scores and Stanford-Binet IQ’s. Correlations of .56, .51, and .68 
were obtained between Matrices scores and W-B Verbal, Performance, and Full Scale 
1Q’s. The correlations between Matrices scores and W-B weighted Verbal, Perform- 
ance, and Full Scale scores were .51, .49, and .64. The correlations between Matrices 
scores and the 11 subtests weighted scores ranged from .60 (Block Design) to .29 
(Arithmetic). 


SUMMARY 


Raven’s Colored Progressive Matrices, the Rev. Stanford-Binet, Form L, and 
the Wechsler-Bellevue were administered to 172 subnormal adult subjects between 
the ages of 16 years and 57 years 6 months. All subjects were under the jurisdiction 
of the Syracuse State School. Pearson product-moment correlations were obtained 
between Matrices scores and Stanford-Binet 1Q’s, W-B I1Q’s and weighted scores 
and subtest weighted scores. The correlation obtained between Matrices scores and 
the Stanford-Binet IQ’s was higher than any of the correlations between the Mat- 
rices scores and W-B 1Q’s and weighted scores. Among the correlations between 
Matrices scores and W-B IQ’s and weighted scores, the highest found was that for 
the Full Seale 1Q’s. The correlations between Matrices scores and the W-B weighted 
subtests scores ranged from .60 (Block Design) to .29 (Arithmetic). 
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NEW USE FOR THE VIGOTSKY BLOCKS 
ELEANOR O. MILLER 
Illinois College 


The Vigotsky Blocks, usually referred to as the ‘concept formation test”, have 
been used in a number of ways and for a number of purposes. Sometimes they are 
considered as a concept formation test for which there is only one correct solution. 
In other cases they are said to have several solutions, more or less acceptable. It is 
our purpose to show their usefulness as a part of a more general “phenomenological 
approach’ to personality characteristics, particularly in regard to thinking. 

There are 22 blocks of five colors, of six two-dimensional shapes, and of two 
different heights. The subject is asked to divide the blocks into four groups. The one 
“correct” solution involves the concept of volume. The blocks are of two heights 
and each such group can be subdivided into two distinct groups by area alone. It is 
for this solution that the nonsense syllables are printed on one side of each block. 
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In addition to this one solution, Fosberg®) states that he has received over fifty 
different solutions. Rapaport lists a number of “usual and acceptable” attempts at 
solution as well as a number of unusual attempts. Kasanin and Hanfman“: 5 ®, 
who have used the test primarily for testing schizophrenic thinking, offer a method of 
using the blocks in a formal and objective manner with the one correct solution of 
volume. More recently, Vinacke has reviewed the use and theoretical aspects of the 
test. 

We have used the blocks as a means of studying methods of thinking with a 
varied group of subjects, including both college students* and patients in a private 
mental sanatorium.** Five of the cases will be presented briefly here. 

The blocks are given to the subjects with instructions to divide them into four 
groups so that there is some common factor among all the blocks in one group and 
so that each group is different from each other group. The nonsense syllables are 
concealed on the reverse side of the blocks. 

Consider the case of Mary, a junior in college, a good but not superior student. 
She first separated the blocks into two groups according to height and then sub- 
divided them into two groups: those which were round and those which were angu- 
lar. Her second solution was a similar one of shape and number of sides. She then 
briefly considered color but gave it up. The third solution was a continuation of the 
first solution but after dividing the blocks by height, she separated them into the 
larger and smaller ones. This is the “correct”’ Vigotsky solution. When urged to try 
for another solution, Mary made a separation into triangles and almost triangles, 
circles and almost circles, semi-circles, and squares. She then gave up the attempt to 
find other solutions. 

This group of solutions shows a way of thinking in which there is a gradual ap- 
proach to the volume concept, one part of the concept discovered first and the 
second part following after the intervening errors. It is Rapaport’s contention “) 
that the discovery of the subject’s thinking methods on this test gives us the clue for 
interpreting his methods of thinking in general. Were we to characterize Mary’s 
thinking, it would seem to be the obvious step-by-step solution, using simpler con- 
cepts to build up more complex ones. 

John, another junior, obtained this same solution by a different method. He 
studied the blocks for nearly two minutes and then with no errors and no hesitation 
moved them into the four groups of division by volume. When asked if he could think 
of other solutions he made an apparent effort but discovered nothing different. This 
solution may well be described as “solution by insight”? and probably occurs only in 
those whose thinking—as indicated by the forming of concepts—is superior. At the 
same time, there is a rigidity here which permits none of the other solutions, even 
though inferior, which Fosberg and Rapaport report. 

The forming of abstract concepts requires a fairly high degree of intelligence, 
education, and mental maturity. The use of these blocks with schizophrenic patients 
proceeds on the assumption that these patients have lost some of the ability to form 
such concepts. We must be sure, however, in using this test with them that they have 
previously displayed abilities we now assume they have lost. We have used the test, 
therefore, primarily with patients whose previous history reveals a better than aver- 
age intelligence and education. One 34 year old patient who was a former WAC 
posed an interesting problem in diagnosis. Her social history suggested a psycho- 
pathic personality but there were also hints of the beginning of a schizophrenic pro- 
cess. When asked to sort the blocks, the patient first divided them, according to 
shape, producing what Rapaport ®) describes as “usual and acceptable” solution. 
The second solution, using color, was an “unusual solution.” The patient described 
this solution as one in which “the colors blend together.” The groups were “orange 
and blue for fall,” “‘orange and gray for spring,” ‘yellow and gray for summer,” and 
“yellow and white for inside the house.” Beginning with the third solution there 


* All students at Illinois College, Jacksonville, Illinois. 
**All patients at the Norbury Sanatorium, Jacksonville, Illinois. 
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were only fantasy solutions (Rapaport’s fabulations). Solution 3 was an amusement 

ark with a merry-go-round of circular and hexagonal blocks, various shaped blocks 
or other rides, angular blocks which represented “science and things”, and a final 
group representing “knock-down bottles and things you shoot at.” 

In the fourth solution the fantasy took the form of a factory. Here the round 
blocks were wheels, the angular ones blueprints, small ones of various shapes were 
nuts and screws, and larger ones were hammers and scrapers. In the fifth solution 
the fantasy was the sky wherein the circular blocks were the moon, earth and sun; 
small yellow and white ones the milky way; larger light colored blocks were clouds 
in the daytime; and the darker colored blocks the clouds at night. Although no 
further solutions were requested it seems quite possible that many other fantasies 
could have been produced. It was evident that the thinking of this patient was col- 
ored by fantasy and that her own accounts of her experiences would have fairy tale 
attributes! One seriously questioned her ability to separate fact from fiction. 

We found test behavior somewhat similar to this in Jim, a college senior, whose 
academic work had been very erratic. After two usual solutions by form and one by 
color in which he put yellow and white into the same group, Jim produced a fourth 
solution in which he described the blocks as building materials. Large tall ones were 
for the “‘base’’; large flat ones were for “covers’’; circular ones were archways and all 
others were windows. This solution closely approximated the solution by volume, 
except for the unreal nature of the terminology. Jim’s thinking was not the “fairy 
tale’ kind, but he seemed to lack the ability to remain long and successfully in the 
realm of the real and abstract. 

When called upon for additional solutions to the first spontaneous ones, some 
subjects try matching edges, examining small details of the blocks. With one hospital 
patient, Bert aged 25, in whom a schizophrenic process was suspected, this was 
particularly noticeable. He proceeded to build towers with the larger blocks at the 
base and carefully matched at least one edge with the others. No matter how often 
he was urged to try another approach, Bert repeatedly returned to the apparently 
regressive technique of piling up blocks. In discussing with Bert his own problems, 
we found it necessary to employ a similar step-by-step process, ‘“block-by-block”’, 
by which we could build up ideas only when one small detail of each fact could be 
“matched” with a small detail of the preceding fact. 

We have presented these five cases to suggest that these blocks can be used 
effectively with the phenomenological point of view. We discover something of the 
individual frame of reference, particularly in regard to the method of thinking, and 
for teaching or counseling, clinical evaluation or therapy, such knowledge can be 
very useful. This in itself will help in the use and interpretation of the more objective 
tests. 
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THE EFFECTS OF MEPHENESIN* AND PRACTICE ON THE 
BENDER-GESTALT PERFORMANCE OF PSYCHIATRIC OUTPATIENTS 


DAVID ROSENTHAL AND STANLEY D. IMBER 
Johns Hopkins University School of Medicine 


PROBLEM 


This study was done as part of a larger investigation reported elsewhere“. 
Previous studies: *. 4. ®& % 1°) had indicated that mephenesin (myanesin, tolserol) 
was effective in relieving anxiety-tension states, hypomanic excitement, agitated 
depressive reactions, and some schizophrenic symptoms, with anxiety and its asso- 
ciated muscular tension singled out most prominently. Mailer“ found no evidence 
that the drug improved mental functioning as measured by the Wechsler-Bellevue 
Scales. Since muscular tension associated with anxiety may reasonably be thought 
to affect performance on a visual-motor task, the Bender-Gestalt test was used to 
measure the drug’s effectiveness with regard to these symptoms. In addition, the 
design of ourgexperiment also enabled us to evaluate the effects of repeated testing 
on patients’ performance of the Bender-Gestalt test. 


METHOD 


Thirty-five outpatients of the Henry Phipps Psychiatric Clinic of Johns Hop- 
kins Hospital participated in the experiment, but only 13 completed it. Our findings 
are based on these 13, whose clinical diagnosis ranged from psychoneurotic reactions 
to the ambulatory schizophrenic types of illness. Patients were interviewed by a psy- 
chiatrist at the start of the experiment and every two weeks thereafter for eight weeks. 
The psychiatrist filled out a check-list of 33 symptoms, each of which was rated on a 
4-point scale of severity. The reliability of the check-list was highly acceptable. 
Following each interview, the patient was given the Bender-Gestalt test and his med- 
ication for the next two-week period. The medication consisted of either placebos, 
3 gms. mephenesin per day, or 9 gms. mephenesin per day, and these were given in a 
randomized sequence so that neither the interviewing psychiatrist nor the patient 
knew what medication was being administered for any two-week period. One of the 
authors administered the Bender-Gestalt tests while the other scored them accord- 
ing to the system of Pascal and Suttell®? without knowing their temporal order. 
Each subject took the test five times. 


RESULTS 
The following specific questions were investigated: 


1. Does mephenesin, in the dosages administered, produce any symptomatic 
improvement? No such improvement was found. 


2. Isa lowering of scores on the symptom check list (indicating a reduction of 
severity of symptoms) accompanied by a lowering of scores on the Bender-Gestalt 
test (indicating improved test performance)? It was noted that patients experienced 
a mild amelioration of symptoms merely as a consequence of participating in the 
experiment ©). This amelioration was not correlated with improvement in Bender- 
Gestalt performance when the effects of practice were partialled out. 


3. Does mephenesin, in the dosages administered, produce any change in 
Bender-Gestalt performance? No such change was found. Special attention was paid 
to the possible effect of the drug on tremor, as manifested on the Bender-Gestalt 
test. Eight of our patients showed scorable tremor on at least one of the five testings, 
but no effect of mephenesin on tremor was found. 


*In the present study, Tolserol (Squibb) was the preparation used. We are indebted to the Squibb 
Institute for Medical Research for making this material and matching placebo tablets available to us. 





THE EFFECTS OF MEPHENESON AND PRACTICE 91 


4. Does repeated testing with the Bender-Gestalt tend to improve the perform- 
ance of patients who do not show a parallel clinical improvement? Addington 
found a trend toward such improvement after a single retest in both schizophrenics 
and normals, although to a statistically significant extent only for the former. How- 
ever, he had no correlative information regarding whether the schizophrenics had 
improved clinically between the testings. Moreover, he seems to have had knowl- 
edge regarding which was the initial test and which was the retest for at least some 
of his patients, a factor which could have readily influenced the scoring of the tests. 
The five test trials of our 13 patients were evaluated by analysis of variance. Differ- 
ences between trials were significant at the 5% (closely approaching the 1%) level 
of confidence, confirming the notion that simple practice alone, without concomitant 
clinical improvement, can improve the Bender-Gestalt performance of psychiatric 
patients. This raises the interesting question of whether any kind of drawing practice 
might not affect Bender-Gestalt performance, and if so, how this factor can be taken 
into account in the clinical evaluation of test records. It should be added, however, 
that not all patients improved their test performances with practice, and that the 
learning curve varied considerably from patient to patient, some patients even 
reversing the direction of the learning curve. The mean scores per trial for the entire 


group and some examples of learning curves for individual patients are shown in 
Table 1. 


Taste 1. Errects oF PracticE oN BENDER-GESTALT PERFORMANCE 
BY PsycHIATRIC OUTPATIENTS 








Patient Trial 
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A 43 
B ‘ 14 
C 30 27 





Mean for entire Group ‘ 34.4 








SUMMARY AND CONCLUSIONS 


1. Thirteen psychiatric outpatients showed no clinical improvement or im- 
provement on the Bender-Gestalt test as a result of taking 3 or 9 gms. mephenesin 
daily for two-week periods. 


2. Mild symptomatic amelioration resulting from participation in the experi- 
ment showed no correlative changes on Bender-Gestalt performance. 


3. Repeated administrations of the test led to improved performance gener- 
ally, but learning curves varied considerably among the patients. Knowledge of 
such a practice effect should make us more circumspect in our clinical evaluations of 
test records. It also raises interesting questions as to which patients profit by prac- 
tice, which do not, why they do or do not, and whether the capacity for such im- 
provement is associated with other clinical variables, as perhaps the potential res- 
ponse to psychotherapy. 
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THE VALIDITY OF A SCHIZOPHRENIC SCREENING SCALE 
OF THE MMPI* 


HERBERT QUAY AND JOHN T. ROWELL 
Milledgeville (Georgia) State Hospital 


PROBLEM 


In an attempt to improve the diagnostic efficiency of the Minnesota Multi- 
phasic Personality Inventory (MMPI), Benarick, Guthrie, and Snyder“ developed 
a critical item scale to differentiate schizophrenics from non-psychotics with elevated 
Schizophrenia (Sc) scores. This scale consisted of eleven items answered more often 
in the scored direction by the schizophrenics. They also obtained an additional ten 
items which were answered in the scored direction by the non-psychotics. A recent 
study by Rubin indicated that the eleven-item scale had some validity in differ- 
os psychotics from non-psychotics, but its use was limited for hospitalized 
subjects. 

Gough ® has suggested a modification of the original scale, which he has called 
the Schizophrenic Screening scale (Sr).1_ This modification consisted of dropping 
one item from the eleven-item scale and adding the ten-item scale, with the scoring 


of the latter reversed. This study is concerned with the efficiency of this twenty- 
item Sr scale as a screening device. 


PROCEDURE 


Subjects were 84 hospitalized schizophrenics, 137 hospitalized psychiatric pa- 
tients with diagnoses other than schizophrenia, and 92 affiliate student nurses. Both 
patient groups had received the MMPI soon after admission. Diagnoses were made 
by the psychiatric staff after a period of observation. The student nurses were ad- 
ministered the MMPI for this research. All subjects were white females. Means and 
standard deviations were computed for all groups. Biserial correlations were ob- 


tained as a measure of the relationship between high scores on the scale and schizo- 
phrenia. 


RESULTS 


Scores obtained by the three groups, with their means and standard deviations 


may be found in Table 1. It is obvious that there was a great deal of overlap among 
all three groups. 


o a authors wish to express their appreciation to Dr. Anders Sweetland for a critical reading of 
e text. 


1Gough, H. G. Personal Communication. April, 1954. 
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The biserial correlation obtained between the two patient groups was .24 
(+.08). For the schizophrenic group and the nurses, it was .82 (+.09). These indi- 
cate little tendency for high scores to be obtained by the schizophrenics. 

A cutting score that would be of practical value to the clinician in differentiating 
the three groups could not be found. For example, a score of ten, which identifies 
only 63% of the schizophrenics, yields 44% false positives for the other patient 


group and 40% false positives for the student nurses. 


Discussion 


It would appear that although the scale differentiates to a statistically signifi- 
cant degree between the three groups, its practical value is limited. 

However, the relative unreliability of psychiatric diagnoses should be pointed 
out. It is quite possible that a follow-up evaluation would find some schizophrenics 
in the other patient group, and vice-versa. One would not expect this to be the case 
in regard to the student nurses, however. 


SUMMARY 


Gough’s modification of a Schizophrenic Screening scale of the MMPI originally 
developed by Benarick, Guthrie, and Snyder was tested for its differentiating ability. 
Eighty-four hospitalized schizophrenics, 137 hospitalized psychiatric patients with 
diagnoses other than schizophrenia, and 92 student nurses were used as subjects. 
All were white females. Although significant statistical differences were obtained, 
the overlap was too great for the scale to have adequate practical value. 
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A NOTE ON VALIDITY, CONSENSUS AND GROUP STANDARDS 
J. F. MORRIS 
University of Manchester (England) 


In recent discussions of the reliability and validity of assessments of personality, 
it has all too often been assumed that high reliability of judgments by a group of 
trained observers has guaranteed high validity. This assumption is tenable only 
under rather special conditions, and several studies—the Kelly-Fiske report being 
the best known—have shown that these conditions are often not present in assess- 
ment procedures. 

Kelly, in a review article“) comments that ‘The curious state of affairs wherein 
the most widely (and confidently) used techniques are those for which there is little 
or no evidence is indeed a phenomenon appropriate for study by social psychologists. 
This reviewer can only assume that in the absence of evidence of their predictive 
validity, such techniques must serve an important function other than assessment”’. 
The purpose of this note is to suggest that much of the research work carried out by 
social psychologists on group consensus is relevant to the proceedings in assessment 
groups. An examination of the factors that seem to be related to high degrees of 
agreement shows that many of them are irrelevant to the goal of valid assessment. 
The principal factors are: 

1. The attractiveness of the group for its members. Consensus tends to be reached more readily 
when the members of the group place value upon their membership. 


2. The presence of pressures to arrive at a unanimous decision (e.g. time limit set by external 
authority or institution for which the assessment group works). 

3. The structure of authority within the group. Where there is a hierarchy of authority, con- 
sensus tends to be centered around the judgments of the leader. 


4. The need for conformity of group members in particular kinds of situations. This need may be 
related to high levels of anxiety which, under certain circumstances, are reduced by group 
agreement. 


5. The degree of discussion on the topic relevant to consensus. Other things being equal, dis- 
cussion—by pooling the factors thought to be relevant by group members—is positively linked 
with agreement. 


6. The degree to which group members share a common “frame of reference’, and the relevance 
of this common-value element, to various tasks faced by the group. 


A factor frequently responsible for lack of validity in group consensus is the 
ambiguity of the group goal. Distortions readily occur when the goal is ill-defined 
and certain members of the group (especially if they are respected members) commit 
themselves to a definite judgment. Underlying all these factors is the general tend- 
ency towards agreement as a condition of the existence of a group. The very fact of 
belonging to a particular group tends, over a period of time, to produce uniformities 
of feeling, of selective perception, of weighting, in the members. As would be ex- 
pected from the general principle of inter-dependence among social variables, the 
factors enumerated above are both conditions and consequences of agreement. 

One of the fullest statements of a group-rating procedure is to be found in the 
OSS report ‘“‘Assessment of Men’’®). The following extracts from the report clearly 
show the presence of each of these factors in the OSS group conferences: 

1. Attractiveness of group. ‘For most of those engaged in this undertaking the whole experience 
was an exceedingly happy and rewarding one. Besides the essential feeling that we were for- 
warding the success of OSS activities and in this way contributing to the defeat of Fascism, 
there were countless satisfactions to be derived from working as members of a congenial stimulating 
dedicated group’. (P. 6, my 1talics) 

2. Pressures to unanimity... . “Occasionally the discussion (on rating) got out of hand, and inter- 
vention was required to bring the disputants back to the topic and to a satisfactory conclusion; 
but, in ae this, the chairman was supported by the unanimous realization that many 
critical decisions had to be made within a limited period of time’’. (p. 218, my italics) 

Structure of authority. ‘The ratings in situations were made by each team member independ- 
ently, although communication between members during the course of the test was not forbid- 
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den; but as soon as (a situation was completed) the team observers met in conference with the 
sheets bearing their own rating, one member bringing with him also a blank sheet on which to 
record the final pooled scores. This man usually would act as chairman of the group. Frequently 
though not always, the senior or the most dominant staff member assumed the role. It was im- 
portant who took it, for the personality of the chairman may well have been a significant determinant 
of the final combined rating’’. (p. 205, my italics) 

4. Need for conformity. “Forces inherent in social relations generally—mutual likes and dislikes, 
bids for dominance and prestige, tendencies to yield too readily to group pressure were not with- 
out effect on our proceedings”. ‘- 2°) 

5. Degree of discussion ‘The purpose of the meeting (of raters) was to replace the several sets of 
ratings ... by one set of ratings (for each candidate in a particular situation). This set of ratings 
was to represent the best judgment of the whole team—ideally each member should be willing 
to subscribe to it. With this aim in view, the disagreements among raters—which we shall see 
later were not great—were not dealt with by merely averaging the individual scores but rather 
were resolved by means of discussion’. ‘®- 2°6) 

6. Common frame of reference. “In all cases the exchange of opinions served to inform each ob- 
server of the criteria of evaluation used by the others and thus contributed to a gradual forma- 
tion of a frame of reference shared by the whole team’’. ‘: 2°) As for the ambiguity of the group 
goals, we find the following comment made: “‘. . . since the job analyses and environment an- 
alyses were unavoidably inadequate for assessment purposes, our conceptions of the different 
roles were extremely vague. Consequently, in assessing each candidate we had nothing specific in 
mind with which to compare his performance’. (pp. 393-394, my italics) The great frankness and cour- 
age of these self-assessments by the OSS team make clearer some of the factors which in prac- 
tice led to agreement on judgments. To what extent they were responsible for the generally low 
validity of the assessments is hard to say. The OSS staff considers much of the difficulty to lie in 
the inadequacy of the appraisals (criterion-ratings), which were in their turn doubtless subject 
to similar group factors. Since the importance of these factors was at no time evaluated, how- 
ever, it is somewhat strange to find the authors stating that “our hypothesis is that individual 
judgments made before listening to a group discussion (between judges) are generally less valid 
than individual judgments made after listening to a group discussion. The chief reason for this 
would seem to be that group discussion brings into each man’s sphere of reflection more facts 
and more interpretations than were there before.”’ ‘°: ®) There is explicit recognition of social 
group influences operating to influence decisions but the group believes that “in a group of five 
or more, which is controlled in a democratic fashion, these factors become less significant as time 
goes on .... There is a tendency for these determinants to balance out, leaving diagnostic ability 
as the chief factor in deciding the course of group opinion’’. ‘ *§) No evidence is offered for 
these conclusions, although this optimistic finding would seem to need some justification. 


The Kelly-Fiske findings show that confidence in a judgment is no guarantee of 
its validity, and that added information may hamper rather than help adequate 
assessment, even though the judge feels the information gives him added insight or 
understanding. Under what conditions then, could we expect group consensus to 
give greater validity than individual judgments? The following factors seem to be 
important. 

(a) When errors known to result from individual biases are pointed out and cor- 
rected by other individuals in the group. This supposes that criteria of error are 
established and that they could in principle be used by the individual judge but 
that he makes mistakes which can be pointed out by others free from error in this 
direction. 

(b) Where a valid assessment requires the collection of a large mass of data 
beyond the power of any one person to assemble in a practicable period. Here the 
assessment task is broken down into a number of specialist performances which may 
then be integrated into a single rating. This assumes that the integrative principles 
are explicit and known. 

(c) Where group members who explicitly or intuitively make judgments of 
higher validity than others act as foci towards which group judgments converge. In 
principle, just as a highly discriminating item in a test schedule can be used alone 
without loss of validity, so these “valid judges” could be used as individuals. It may 
be objected of course, that this analogy is misleading, and that a ‘“‘good judge” may 
derive his skill from participating in a group conference at which he can hear diverse 
points of view, which he then proceeds to integrate more adequately than others. 

The only comment that can be made on this viewpoint is that the diverse points 
of view heard at a group conference are just as likely to confuse as to assist the good 
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judge. The structure of assessment groups varies largely, but there are often marked 
differences in seniority between group members. In these circumstances all kinds of 
considerations extraneous to the goal of accurate assessment are free to enter. The 
finding of many social psychologists that group judgments tend to converge towards 
the mean of the judgments of the individual members is prebably limited to transi- 
tional groups in which the members are of equal status. When the convergence is 
towards the judgment of the senior members it is not necessarily towards greater 
validity. As Holt and Luborsky comment® “. . . there is no obvious relation be- 
tween the validity of an interviewer’s ratings and his psychiatric experience, reputa- 
tion, age, or the length of time he has been doing the job of selecting candidates’. 
These writers however found that the mean of three interviewers’ independent judg- 
ments had higher validity than the judgments of individual interviewers. It would 
have been interesting to see if pooled ratings arrived at by discussion among the 
interviewers would have improved the validity of the assessments. 

It is not in any way maintained that the consensus-producing factors listed here 
are the only factors of importance in assessment groups. Emphasis has been placed 
upon them because their influence has been relatively neglected in assessment pro- 
grammes, and the intensiveness and homogeneity of training of clinical psychologists 
is likely to increase the effect of group factors, not least those group factors which are 
carried around in the heads of supposedly independent judges. 
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CORRECTION 





Dr. William Schofield wishes to make the following corrections to his annual 
review article ‘Research in Clinical Psychology: 1953” which appeared in the July 
1954 issue of this Journal: 


On page 211, in the review of the Hovey and Stauffacher study, the 
words ‘mechanical’ and ‘clinical’ were interchanged in the third from the 
last sentence. As corrected the sentence reads, ‘‘Using the most critical of 
two levels of criterion, the number of ‘hits’ to ‘misses’ was 5.8:1 for the 
clinical predictions but only 2.1:1 for the mechanical method.”’ 


Due to a mis-reading of tabular material, the study by Berger (J. 
Consult. Psychol., 1953, 17, 355-358) reviewed on Page 207, was erroneously 
criticized for a lack of statistical support for the author’s conclusion. The 
oa Values reported were mis-read as standard errors or mean differences 
when they were actually only sigmas of difference distributions. Con- 
sequently Berger correctly interpreted twelve of the fifteen Rorschach 
variables studied as yielding reliable mean differences between his stress 
and control groups. 











EDITORIAL OPINION 





REORGANIZATION OF THE DIVISIONAL STRUCTURE OF THE 
AMERICAN PSYCHOLOGICAL ASSOCIATION 


Members of the APA will soon be called upon to express their opinions of a 
proposed plan for the reorganization of the divisional structure of the APA. Some 
type of reorganization appears necessary due to the organizational complexity in- 
cident to the large number of divisions already established with the possibility of 
many more to come. The rapid development of applied psychology, in particular, 
into many other fields of specialization such as penal psychology or psychology in 
engineering has resulted in increasing pressures for the establishment of additional 
divisions representing their specialized interests. The existing divisional structure 


promises to become so unwieldy as to seriously handicap the general functioning of 
the APA. 


Several possibilities are open for dealing with this problem of a too cumbersome 
organizational structure. First, the existing plan of organization might be retained 
by increasing the administrative apparatus necessary for handling more divisions. 
With minor changes of procedure such as, perhaps, holding annual meetings of larger 
divisions separately from the general annual convention, we could go on indefinitely 
under the existing plan. Second, the existing number of divisions could be reduced 
materially by amalgamating existing divisions with overlapping interests into a few 
large divisions. The resulting large divisions would be organized along the lines of 
major fields of interest with specialized groups being allowed to make their own ar- 
rangements for special meetings. This alternative would probably be protested 
against by members of smaller divisions who would lose representation and general 
APA recognition of their specialities. Finally, a third alternative would retain the 
APA as the general administrative and promotional organization representing pro- 
fessional psychology and would encourage the establishment of organizationally 
independent but professionally affiliated associations representing any groups of 
specialists who may care to organize themselves. Proponents of this latter plan argue 
that independently organized specialty groups could better carry on specialized 
functions such as upholding membership qualifications, policing their fields, pro- 
moting research or special applications, and other administrative functions. 

A review of recent history will clarify some of the issues at stake. The schism 
about twenty years ago of applied psychologists away from the APA to form an in- 
dependent AMERICAN ASSOCIATION OF APPLIED PsyYCHOLOGISTs represented an at- 
tempt of applied practitioners of psychology to secure proper recognition, representa- 
tion and right to promote their special fields of interest. At first, the AAAP was 
received with a great deal of enthusiasm but within 10 years it was perceived as not 
fulfilling its purposes and there arose pressure to repair the fission of professional 
psychologists by amalgamating the APA and the AAAP into one large group repre- 
senting the interests of all under the existing plan of organization. In reviewing the 
failure of the AAAP, it is now seen that its organization was premature and did not 
make proper provision for the interests of special groups within the applied field. 
While in 1936 there were not numerically large groups of applied psychologists, their 
professional interests were so diverse as not to be amalgamated easily into one 
organization such as the AAAP. The AAAP turned out to be too much like the par- 
ent APA, being an unnatural combination of many applied groups whose interests 
were still too diverse to function harmoniously within one organization. We must 
conclude that the AAAP was a move in the right direction but that it did not go far 
enough in satisfying the diverse specialized needs which were the reason for its 
organization. 
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It appears to us that the basic issue is one of size in relation to the multiplicity 
of needs and functions which characterize a large professional organization. The 
problems of the APA are very similar to those of the AmericAN Mepicat Assocta- 
TION, the evolution of whose organization establishes a precedent. In the 19th 
century when the number of physicians exceeded 50,000 and rapidly approached the 
present number of 150,000, the AMA became so large that it could function efficient- 
ly only in relation to general problems relating to the administration, promotion, 
organization and public relations of the profession. The membership was so large 
that meetings could accommodate only a fraction, and the actual mechanics of carry- 
ing out functions had to be delegated to state, county and specialty group levels of 
organization. Scientific programs could be organized only on a special interest basis 
touching only on the most timely topics, and no specialized functions for special 
interest groups could be carried out under general AMA auspices. Under these condi- 
tions, the AMA became the general representative of the medical profession with 
membership open to all licensed physicians who could qualify for membership in 
county medical societies. Strong independent but affiliated specialty organizations, 
such as the AMERICAN PsycHIATRIC ASSOCIATION, arose to fulfill the specialized needs 
of groups of specialists. In time, the major specialties themselves became fraction- 
ated to fit the needs of specialties within specialties. Within this evolutionary trend, 
the activities of the AMA became more general and the activities of the sub-special- 
ee more specialized, thus serving all general and specific needs of the pro- 

ession. 

The APA is rapidly reaching a size where some fractionation is highly desirable. 
Only a few cities are capable of handling conventions with the current APA member- 
ship of around 12,500. Administrative activities under the existing plan of divisional 
organization are so cumbersome that council representatives and key officers are 
required to spend almost the entire convention period dealing with business matters. 
The conventions are becoming so large and unwieldy as to lose the personal flavor of 
earlier years when the APA was of manageable size. It bas become necessary to limit 
the attendance at advertised programs by excluding those who did not previously 
arrange for tickets, and more of these ‘‘closed”’ meetings are in prospect. We have 
already reached the size of a prehistoric monster no longer able to maintain itself 
efficiently. 

Under such conditions, it is inevitable that major specialty groups will begin to 
fractionate off from the parent APA. Any plan for reorganization of the APA should 
begin to make provision for recognition of affiliate specialty organizations which will 
take over many functions now executed by the APA. The need for a separate spec- 
ialty organization is already acute in the field of clinical psychology which is the 
fastest growing and potentially largest specialty field. It is just a matter of time when 
we will have an AMERICAN ASSOCIATION OF CLINICAL PsYCHOLOGISTs or a SOCIETY 
oF INDUSTRIAL PsyYCHOLOGISTs. 

F. C. T. 
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